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Methods and Materials for Identifying Inhibitors Of Molecular 
5 Interactions Mediated by SH3 Domains 

Technical Field 

This invention relates to materials and methods for identifying inhibitors of 
molecular interactions mediated by SH3 domains, and for compositions and uses 
1 0 relating to such inhibitors. 

Introduction 

A fundamental goal of recent medical and pharmaceutical research has been the 
elucidation of mechanisms of diseases and the identification of critical points within 

1 5 such mechanisms for effective pharmaceutical intervention. Unfortunately, in some 

cases the disease mechanism has not yet been sufficiently defined to permit precise 
intervention, while in other cases gaps in technology have hampered the research 
effort. 

One mechanistic aspect of cellular function in both normal and disease states 

2 0 which has attracted increasing attention is cellular signal transduction, the series of 

events leading from extracellular events to intracellular sequelae. Numerous proteins 
that function as signal transducing molecules have been identified. These include 
receptor and non-receptor tyrosine kinases, phosphatases and other molecules with 
enzymatic or regulatory activities. One feature common to many of these molecules is 

2 5 the capacity to associate specifically with other proteins to form a signaling complex 

that can alter cell activity. 

Signaling proteins often contain domain(s) of conserved sequence, which serve 
as non-catalytic modules that direct protein-protein interactions during signal 
transduction. Two such domains have been termed the src homology domain 2 (SH2) 

3 0 and src homology domain 3 (SH3). These domains are relatively small (-100 amino 

acids for SH2 and ~ 60 amino acids for SH3) and are found in various combinations 
and locations in different proteins. For example, some members of the src-family of 
tyrosine kinases, e.g., Abl, GRB2 and PI3K each contain both SH2 and SH3 domains. 
In contrast, the tyrosine kinase Syk has two SH2 domains, while two components of 
3 5 NADPH oxidase, p47-phox and p67-phox, each have two SH3 domains. The 

presence of multiple SH2 and /or SH3 domains within a protein increases the variety 
of potential protein-protein interactions. 
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Aspects of the structures of various SH2 domains are known and certain 
aspects of their role in signal transduction is becoming better understood. SHE 
domains direct the association of specific proteins by binding selectively and with 
specificity to protein sequences containing phosphotyrosine. For example, upon 
5 ligand binding, the PDGF fi-receptor dimerizes and autophosphorylates multiple 
tyrosine residues. This phosphorylation triggers the physical association of SH2- 
containing proteins such as c-src, PLC-gamma, PI3K and ras-GAP with the receptor, 
forming a signaling complex. An analysis of SH2 binding using natural ligands 
containing mutations at residues surrounding the site of phosphorylation as well as a 
1 0 screen of combinatorial peptide libraries using SH2 domains has revealed the 
specificity of SH2 binding. 

Less is known about ligand specificity for SH3 domains. Initial inroads towards 
understanding SH3 ligand specificity came first from a screen of a cDNA expression 
library using the c-abl SH3 domain as a probe. Two proteins were identified, termed 

1 5 3BP-1 and 3BP-2. Short proline-rich sequences within these proteins were 

demonstrated as important for c-abl SH3 binding. 

Pharmaceutical agents which interfere with the formation or stability of signaling 
complexes formed by proteins containing one or more SH3 domains and their natural 
ligands could be used to treat or prevent the diseases or their pathological effects 

2 0 mediated by such complexes — but only if these interfering agents have sufficient 

binding specificity. Unfortunately, too little has been known about the natural ligands 
and the binding specificities of the various SH3 domains to know whether selective 
interference with SH3 binding would actually be possible, and if it were, to permit the 
identification or design of agents capable of so acting. Indeed, the limited apparent 

2 5 sequence requirements (proline rich sequences) for binding to SH3 domains together 

with the limited expected affinities of interactions therewith were considered 
discouraging, especially in comparison to emerging information for SH2 domains. 

Summary of the Invention 

3 0 The present inventors have discovered that protein-protein interactions 

mediated by SH3 domains are characterized by a surprisingly high level of binding 
specificity, far beyond a simple requirement for "proline rich ,/ ligands. Indeed, the 
present inventors have found that peptide ligands (including "synthetic" peptide 
ligands, i.e., ligands containing amino acid sequences not heretofore found in nature) 
3 5 can be identified which bind to a particular SH3 domain selectively and with high 

affinity. They have developed a general methodology for identifying specific peptide 
ligands for SH3 domains and have applied it to representative SH3 domains for 
which they identified selective, high affinity ligands. 



WO 95/24419 J PCT/US95/03208 

Using this technology for identifying specific ligands for SH3 domains, the 
present inventors have devised in vitro and in vivo assays for identifying inhibitors of 
SH3-mediated interactions. The materials and method used in those assays provide 
for high-throughput identification of SH3 inhibitors. 
5 These discoveries thus permit, for the first time, the practical identification of 

compounds which specifically block the signaling of an SH3-bearing protein of 
interest and which may be used as pharmaceutical agents. 

Interestingly, the present inventors' initial success in identifying specific binding 
partners for individual SH3 domains hinged on a serendipitous polymerase accident, 
1 0 discussed in greater detail below. That accident led to the first positive results 

demonstrating the feasibility of their overall approach. Those and subsequent results, 
combined with a novel extension of BIAcore® technology— which itself should be of 
widespread interest and applicability — confirmed the significance of their findings, 
and again, the feasibility of their overall approach. 

1 5 This invention provides materials and methods, both in vitro and in vivo, for 

identifying substances which bind selectively to an SH3 domain of interest (i.e., to a 
protein or peptide, preferably of human origin or sequence, containing an SH3 domain 
of interest) or which selectively block or otherwise inhibit the interaction of proteins 
mediated by one or more SH3 domains. 

2 0 This invention thus provides a reproducible, generally applicable method for 

obtaining an SSL which binds selectively to an SH3 domain of interest and further 
provides illustrative peptide SSLs for representative SH3 domains. The SSLs, fusion 
proteins containing an amino acid sequence corresponding to that of an SSL and 
DNA or RNA molecules encoding them are useful in the present methods for 

2 5 identifying SH3 binding agents and SH3 blocking agents. 

SSLs may be identified by combining an SH3 domain of interest {e.g. in the form 
of a peptide, protein, protein fragment or fusion protein ) with one or more candidate 
ligands [e.g. a peptide (including peptide SSLs which contain an amino acid sequence 
not present in a naturally occurring protein), peptoid, protein, protein fragment, 

3 0 fusion protein, non-peptide backbone displaying "side chains" of natural amino or 

imino acids, nucleic acid or any hybrid of the preceding] under conditions permitting 
a ligand to bind to an SH3 domain to form an SH3-ligand complex, then detecting 
which if any candidates 1) bound to the SH3 domain or 2) blocked binding the SH3 
domain to a known SH3 ligand. This methodology may be applied to SH3 domains 
3 5 present in Abl, Src, Grb-2, Nek or p85 PI 3-K, as well as to SH3 domains from other 
proteins. A candidate ligand which bound to the SH3 domain may be recovered from 
the SH3-ligand complex and further characterized. This can be accomplished, e.g. by 
separating unbound candidate ligands from any SH3 domain-ligand complexes so 
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formed, dissociating the complexed ligand(s) from the SH3 domain-ligand complexes, 
enriching for the selected ligands by re-contacting the dissociated ligands with the 
SH3 domain under conditions permitting a ligand to bind to an SH3 domain to form 
a complex, and detecting which any candidate ligands bound to the SH3 domain. The 
enriched ligand(s) may be dissociated from the complexes so formed and may then be 
structurally characterized, if desired. Peptide ligands comprising an amino acid 
sequence not present in a naturally occurring protein are of special interest, especially 
in cases in which the SH3 domain is other than an Abl or p85 PI 3-Kinase SH3 
domain. A ligand so selected may be re-assayed as above, but against different SH3 
domains, to determine the presence and/or extent of binding of the ligand to a 
different SH3 domain — i.e., to confirm the selectivity of the interaction with the SH3 
domain against which it was identified. Peptide SSLs may thus be obtained 
containing 7 amino acid residues or more, and as disclosed below, may in some 
applications contain at least 10 amino acids residues, and in some cases as least 12, 
14 or 15 amino acid residues. Peptide SSLs may be so obtained for SH3 domain- 
containing proteins include (1) members of the src-family protein tyrosine kinases 
(Src, Lyn, Fyn, Lck, Hck, Fgr, Yes), (2) Grfc-2, which has two SH3 domains, (3) Sprk, a 
threonine/serine protein kinase, (4) Tsk, (5) Btk, (6) Txk and other Tec family 
members, (7) Vav, (8) GTPase Activating Protein (GAP), (9) p40, p47, and p67 
proteins of the neutrophil oxidase complex, and (10) p85 phosphatidylinositol 3' 
kinase, (11) Crk, (12) phospholipase C gamma, and (13) Abl. Ligands containing 
peptide sequences not found in naturally occurring proteins may be selected for SH3 
domains of Src, Lyn, Fyn, Yes, Txk, Tsk, Btk, NADPH oxidase p47-phox and p67- 
phox, Crk and other proteins. DNAs encoding such peptide ligands may be readily 
prepared using conventional methods and materials. 

For example, a mixture of candidate ligands, preferably peptides, may be 
contacted with an SH3 domain of interest and incubated under conditions permitting 
the formation of an SH3-ligand complex. Uncomplexed candidate ligands and SH3- 
ligand complex(es) are separated from each other, and ligand(s) bound to the SH3 
domain are separately recovered and identified. A preferred approach is disclosed 
infra in which the SH3 domain of interest, in the form of a fusion protein, is used to 
select peptide ligands from combinatorial libraries displayed on phage. Sequences so 
identified may be re-assayed as mentioned above, whether in the form of phage- 
displayed peptides, as part of any recombinant protein or as synthetic peptides. 

SSLs so obtained may then be used as follows to identify SH3 binding or 
blocking agents. Generically stated, the method of this invention employs: (1) a 
peptide which contains an SH3 domain of interest, (2) a specific SH3 ligand (SSL) of 
this invention (which in in vitro embodiments can be a peptide or nonpeptide, but is 
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preferably not a protein) which is capable of selectively binding to the SH3 domain of 
interest to form an SH3-SSL complex and (3) a compound (the "test substance") to 
be evaluated for its ability to bind competitively to a selected SH3 domain, or block 
molecular interactions mediated by that SH3 domain, in a specific and selective 
5 manner. The terms "peptide" and "protein" are used interchangeably herein, except 
in those instances in which an explicit distinction has been drawn. The SSL may be a 
peptide SSL comprising a peptide sequence which is present in a naturally occurring 
protein or a peptide sequence which is not present in a naturally occurring protein. 
The method is carried out by combining the three components mentioned above, or 
1 0 compositions comprising them; incubating the resulting test mixture under conditions 
permitting the formation of an SH3-SSL complex; and measuring the ability of the 
test substance to compete with the SSL for binding to the SH3 domain or to otherwise 
block the formation or reduce the observed level of SH3-SSL complex. This method is 
a powerful and general method, and should be applicable to any SH3 domain and 

1 5 susceptible to variety of configurations, including both in vitro and in vivo formats. 

Depending on the specific assay configuration, it may be important to use known 
concentrations of SH3 domain, SSL and /or test substance. For comparative 
purposes, the assay may also be carried out in the absence of the test substance or in 
the presence of varying concentrations of test substance. One may carry out the 

2 0 measuring step by assaying for SH3-SSL complex, non-complexed SSL and/or non- 

complexed test substance or by measuring the occurence of an event mediated by the 
presence or formation of the SH3-SSL complex or an SH3-test substance complex. 

The test substance may be present in a solution, referred to as a test solution. 
Alternatively, especially (but not exclusively) for in vitro assays, the test substance 

2 5 may be present in a test mixture comprising an emulsion, suspension or other mixture; 

exposed on the surface of a cell, virus, phage, etc.; or immobilized on a solid support. 

In an in vitro format, a binding assay is conducted to identify a compound 
capable of binding to the SH3 domain in the presence of a specific SH3 ligand for 
that SH3 domain or otherwise capable of blocking the formation or reducing the 

3 0 observed level of SH3-SSL complex. In one embodiment, the binding assay is a 

competitive binding assay in which the three components are combined and 
incubated under conditions permitting the formation of an SH3-SSL complex. The 
ability of the test substance to bind to the selected SH3 domain or otherwise block 
the SH3-mediated interaction in the presence of the specific SH3 ligand is 
3 5 determined. Binding to the SH3 domain or otherwise blocking the SH3-mediated 
interaction may be measured directly or indirectly (e.g., BIAcore® and other SPR 
technologies (BIAtechnology Handbook, Pharmacia Biosensor AB, Uppsala, Sweden, 
1994), fluorescence anisotropy and allied technologies (Luminescent Spectroscopy of 
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Proteins, 164pp, E. A. Permyakov, CRC Press, Inc, Boca Raoton, FL, 1992), flow 
cytometry and allied technologies (Flow Cytometry and Cell Sorting, 223pp., A. 
Radbruch, ed., Springer-Verlag, New York, NY, 1992), ELISA, RIA and allied 
methodologies (An Introduction to Radioimmunoassays and Related Techniques, 290 pp., 
5 T. Chard, Elsevier Science Publishers, Amsterdam, The Netherlands, 1990), 

competitive and non-competive affininty interactions (Immobilized Affinity Ligand 
Techniques, 454 pp., G. T. Hermanson, A. K. Mallia and P.K. Smith, eds., Academic 
Press, Inc., San Diego, CA, 1992). If binding of the SH3 domain and the specific SH3 
ligand occurs to a lesser extent in the presence of the test substance than in its 
1 0 absence, for instance, if the presence of the test substance reduces the concentration 
of SH3-SSL complex or increases the concentration of non-complexed SSL, then the 
test substance is an SH3 binding or blocking agent. If the structure of the SH3 binding 
agent so identified is not yet known, the compound may then be isolated from the 
other assay components and characterized. It may be re-evaluated, if desired, using 

1 5 one or more different SSLs to the same SH3 domain and/or assayed in similar 

binding assays with different SH3 domains to confirm the selectivity of the 
interaction with the SH3 domain with which it was identified. If desired, the binding 
of the SH3 binding agent to the SH3 domain with which it was identified may be 
characterized biochemically, e.g. through the use of BIAcore® technology, described in 

2 0 greater detail below. The SH3 binding agent so identified may be assayed in an in 

vivo SH3 blocking assay as described below and may further be evaluated for 
pharmacological activity in various in vitro and/or in vivo assays, as desired. 

in vivo assays can be conducted in analogous manner using cells containing the 
SH3 domain of interest and a ligand therefor. The cells are cultured or maintained in 

2 5 a medium suitable for cell growth. The test substance is added to the cells, e.g. to the 

medium in which the cells are cultured, and the culture is incubated under conditions 
permitting formation of a complex between the SH3 domain of interest and its ligand. 
If binding of the SH3 domain and its ligand occurs to a lesser extent in the presence of 
the test substance than in its absence, for instance, if the presence of the test 

3 0 substance reduces the concentration of SH3-ligand complex or increases the 

concentration of non-complexed ligand, then the test substance is an SH3 binding or 
blocking agent. The presence or absence of SH3-ligand complex may be measured 
directly or indirectly (e.g., by measuring the occurence of an event mediated by the 
presence or formation of the SH3-ligand complex or an SH3-test substance complex). 
3 5 An illustrative in vivo format relies upon genetically engineered cells capable of 

expressing a reporter gene under SH3-mediated transcriptional control. These cells 
contain an SH3 domain of interest and the corresponding SSL, each in the form of a 
distinct fusion protein. Each such fusion protein thus comprises, among other 
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component regions, at least one SH3 domain or SSL sequence. The SH3 fusion protein 
and SSL fusion protein are capable of forming a complex with each other. The cells 
express the reporter gene unless a substance is present which binds to the relevant 
SH3 domain or otherwise blocks the SH3-mediated interaction required for 
5 transcription of the reporter gene. In this assay, the cells are cultured or maintained in 
a suitable culture medium to establish a base-line for expression of the reporter gene. 
The test substance is added to the culture medium and the ability of the test 
substance to inhibit expression of the reporter gene is measured. If the level of 
reporter gene expression is reduced in the presence of the test substance, the test 
1 0 substance is an SH3 blocker with respect to the SH3 domain involved in 

transcriptional control. If the structure of the SH3 blocking agent so identified is not 
yet known, the compound may then be isolated from the other assay components and 
characterized. It may be re-evaluated, if desired, using engineered cells containing a 
fusion protein with one or more different SSLs to the same SH3 domain and /or 

1 5 assayed with cells containing fusion proteins comprising different SH3 domains and 

corresponding SSL sequences to confirm the selectivity of the interaction with the SH3 
domain with which it was identified. If desired, the binding affiinity of the SH3 
blocking agent for the SH3 domain with which it was identified may be determined, 
e.g. such as through the use of BIAcore® technology, described in greater detail below. 

2 0 The SH3 blocking agent so identified may be assayed in an in vitro SH3 binding assay 

as described above and may further be evaluated for pharmacological activity in 
various in vitro and /or in vivo assays, as desired. 

Specific SH3 ligands and SH3 binding or blocking agents identified by the 
present method can be identified from peptide libraries as well as from test 

2 5 substances obtained from a wide variety of sources including, e.g., microbial broths; 

cellular extracts; conditioned media from cell lines or from host cells transformed 
with genetic libraries; collections of synthetic compounds; combinatorial libraries or 
synthetic programs based on conventional medicinal chemistry approaches or 
structure-based drug design. 

3 0 This invention thus provides a means for identifying selective SH3 binding or 

blocking agents. As noted at the outset, SH3 domains are present in a wide variety of 
proteins, including proteins involved in intracellular signal transduction pathways, 
proteins associated with the cytoskeleton and adapter proteins. Accordingly, SSLs 
and specific SH3 binding or blocking agents identified through this invention may be 
3 5 useful for a variety of purposes. First, they may be useful as biological reagents in 
assays as described herein for functional classification of an SH3 domain of a 
particular protein, particularly a newly discovered protein. Families or classes of 
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SH3-bearing proteins may now be defined functionally, with respect to ligand 
specificity. 

Moreover, SH3 binding or blocking agents of this invention can be used to inhibit 
the occurrence of biological events resulting from molecular interactions mediated by 
5 an SH3 domain. This invention thus provides a method and reagents for inhibiting 
(totally or partially) the interaction between a protein containing an SH3 domain and 
a natural ligand thereto (i.e., a protein which normally binds in a cell to the SH3- 
bearing protein) or a biological activity mediated by such interaction. In this method, 
an SH3 binding or blocking agent, such as an SSL or an SH3 binder or blocker 
1 0 identified by this invention, including a peptide of 4-50 amino acid residues, more 

preferably 7-15 amino acid residues in length, is combined or contacted with the SH3 
domain-containing protein, such as by introducing the SH3 binding or blocking agent 
into a cell in which the SH3-mediated interaction is to be inhibited. Following 
introduction of the SH3 binding or blocking agent, the interaction of the SH3 domain- 

1 5 bearing protein and its natural ligand is inhibited as may be readily detected. 

Inhibiting such interactions can be useful in research aimed at better understanding 
the biology of SH3-mediated events. 

Such SH3 binding or blocking agents would be useful, for example, in the 
diagnosis, prevention or treatment of conditions or diseases resulting from a cellular 

2 0 processes mediated by an SH3-based interaction. For example, a patient can be treated 

to prevent the occurence or progression of osteoporosis, osteomalacia, or Paget's disease, or to 
reverse the course of these diseases by administering to the patient in need thereof an 
SH3 binding or blocking agent which selectively binds src SH3. There are many other 
conditions for which SH3 binding or blocking agents can be used therapeutically, 

2 5 including restenosis, rheumatoid arthritis, gout, asthma, emphysema, immune 

vasculitis, ulcerative colitis, psoriasis and acute respiratory distress syndrome, in 
which an SH3 of neutrophil oxidase p47 and p67 complex has been implicated. 
Other relevant conditions include chronic myelogenous leukemia, in which case SH3 
domains of Grb-2 are targeted. It has recently been shown that the BCR-abl oncogene 

3 0 in CML participates in the ras pathway for growth stimulation through its interaction 

with Grb-2. In these cells, inhibition of the interaction of Grb-2 SH3 domains with the 
SOS oncogene will block its ability to stimulate cell proliferation. Still other relevant 
conditions include cancers such as breast cancer, glioblastomas, head and neck 
tumors and ovarian tumors, for which the SH3 domain of Grb-2 would be targeted. 
3 5 For example, tumors with associated amplification of receptors for EGF and PDGF 

could be inhibited by blocking activation of the Ras pathway through inhibition of the 
interaction between Grb-2 (SH3)and Ras. Furthermore, since the SH3 domain of Src 
family kinases are believed to be involved in activation of T-cells, B-cells, mast cells, 
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and NK cells and since the SH3 domains of the tyrosine kinases Tsk and Btk are 
believed to be involved in T-cell (Tsk SH3) and B-cell (Btk SH3) function an SH3 
binding or blocking agent of this invention could be administered to a patient in need 
thereof to suppress immune function. 
5 An SH3 binding or blocking agent identified by the method of this invention can 

be formulated into a pharmaceutical composition containing a pharmaceutically 
acceptable carrier and/or other excipient(s) using conventional materials and means. 
Such a composition can be administered to an animal, either human or non-human, 
for therapy of a disease or condition resulting from cellular events involving an SH3- 
1 0 mediated protein-protein interaction. Administration of such composition may be by 
any conventional route (parenteral, oral, inhalation, and the like) using appropriate 
formulations as are well known in this art. The SH3 binding or blocking agent of this 
invention can be employed in admixture with conventional excipients, ie, 
pharmaceutically acceptable organic or inorganic carrier substances suitable for 

1 5 parenteral administration. 

Brief Description of the Figures 

Figure 1 (SEQ ID NOS: 1 to 64) shows the amino acid sequences (SEQ ID 
NOS: 3 to 64) of peptides identified by screening a phage display library, designated 

2 0 the X 6 PPIP library (SEQ ID NO: 3) (containing the sequences XXXXXXPPIP, where X 

is any amino acid), with Src, Fyn, Lyn, PI3K, Abl, Tsk and Yes GST-SH3 fusion pro- 
teins and with GRB2 (which contains two SH3 domains). The peptides displayed on 
the surface of M13 bacteriophage were inserted into the amino terminus of M13 gene 
III sequences (two residues from the amino terminus) such that all displayed peptides 

2 5 had the gene III residues alanine-glutamic acid at their N-terminus followed by the 

rest of gene HI sequences. Amino acids that are in italics are genelll residues. Isolates 
selected from the X6PPIPG library are listed. The amino acids that are underlined 
were inserted into genein, adjacent to the six randomized residues. Amino acid 
preferences are highlighted in bold type. Numbers in parenthesis denote isolates that 

3 0 have the same nucleotide sequence. The isolate from the first library (containing the 

glycine to arginine change) is also shown. 

Figure 2 (SEQ ID NOS: 65 to 113) shows the amino acid sequences of 
peptides identified by screening phage display libraries, RSLRPLXXXXXX (SEQ ID 
NOS: 65 to 99), PPPYPPXXXXXX (SEQ ID NOS: 100 to 108) and RLYRPLXXXXXX 
3 5 (SEQ ID NOS: 109 to 113) with various SH3 domains. 

Figure 3. Binding of phage-displayed peptides to Src and Abl SH3 domains. 
Binding of phage-displayed peptides was measured using BIAcore® technology as 
described below. Interaction between the receptor (immobilized SH3 domain) and ligand 
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(phage-displayed peptide) is indicated by an increase in RU value. Traces have been 
corrected for bulk refractive index contributions of the unbound phage. Upper right 
panel: Binding of "RSSLRPLPPIP phage" to immobilized GST-Src SH3. Upper left 
panel: Binding of "RSSLRPLPPIP phage" to immobilized GST-Abl SH3. Lower left 
5 panel: Binding of "PPPYPPPPIP phage" to immobilized GST-Src SH3. Lower right 
panel: Binding of "PPPYPPPPIP phage" to immobilized GST-Abl SH3. These data 
illustrate the selectivity of binding of the phage-borne SSL sequences for the SH3 domains 
with which they were selected. 

Figure 4 (SEQ ID NOS: 133 to 156) shows the amino acid sequences (SEQ ID 
1 0 NOS: 134 to 156) of peptides identified by screening a phage display library, 

designated the XXXXXRRLPPLPPP library (SEQ ID NO: 133), with GST-Src, Lyn and 
PI3K SH3 fusion proteins. 

Figure 5 (SEQ ID NOS: 114 to 132) shows the amino acid sequences (SEQ ID 
NOS: 115 to 132) of peptides identified by screening a phage display library, 

1 5 designated the RSLRPLPPLPXXXXX library (SEQ ID NO: 114), with GST-Src, Lyn, 

Fyn, Yes and PI3K SH3 fusion proteins. 

Figure 6 shows BIAcore® binding analysis of six phage displayed peptides to 
immobilized GST-Src or the GST-Lyn SH3 domains. Included in this analysis are the 
phage that display Src selected sequences (5) (SEQ ID NO: 149), Lyn selected 

2 0 sequences (6) (SEQ ID NO: 141), two phage which display core consensus sequences 

(1, 2) (SEQ ID NO: 8 and SEQ ID NO: 6), and two samples which did not have the 
arginine residue as part of the core consensus sequence and were correctly predicted 
to bind with lower affinity (3, 4) (SEQ ID NO: 39 and SEQ ID NO: 24). 

Figure 7 shows the amino acid sequences of peptides identified by screening a 

2 5 phage display library, designated the GAAPPLPPRXXXXX library, with GST-Src, 

Lyn, Fyn, Yes and PI3K SH3 fusion proteins. 

Figure 8 shows the amino acid sequences of peptides identified by screening a 
phage display library, designated the random lOmer library (XXXXXXXXXX), using 
NH2 terminal Crk, Lyn, Src and Fyn GST-SH3 fusion proteins. 

3 0 In Figures 1-8 (and elsewhere), the three-letter symbols represent the following 

amino acids: ala = alanine; arg = arginine; asn = asparagine; asp = aspartic acid; asx 
= Asn and/or Asp; cys = cysteine; gin = glutamine; glu = glutamic acid; gly = glycine; 
his = histidine; ile = isoleucine; leu = leucine; lys = lysine; met = methionine; phe = 
phenylalanine; pro = proline; ser = serine; thr = threonine; trp = tryptophan; tyr = 
3 5 tyrosine; val = valine. 

Figure 9. Construction of GAL4-SrcSH3 DNA binding domain expression 
vector. A segment of DNA encoding amino acids 84-145 of human Src SH3 is 
amplified by PCR using two primers, SH3PCR1 and SH3PCR2. The amplified DNA 
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is cleaved with restriction enzymes EcoRI and BamHI then ligated into the vector 
pSG424 (Sadowski and Ptashne, Nuc. Acids Res. (1989)18, 7539). Upon 
introduction into the appropriate cells, this vector produces a fusion protein 
consisting of the N-terminal portion of GAL4 (amino acids 1-147) fused to the Src 
5 SH3 domain. In the figure, the black box in front of the GAL4 portion represents the 
promoter region required for expression. Other vector elements, such as selectable 
markers, are not shown. 

Figure 10. Construction of SSL-VP16 activation domain vector. Two 
synthetic oligonucleotides are annealed to form a double stranded DNA fragment 
1 0 encoding an SSL. The SSL DNA is ligated into the vector pCGN (Tanaka and Herr, 
Cell (1990) 60, 375-386). Upon introduction into the appropriate cells, this vector 
produces a fusion protein consisting of the SSL fused to the transcriptional activation 
domain of Herpes Simplex Virus VP16. In the figure, the black box in front of the SSL 
portion represents the promoter region required for expression. Other vector elements, 

1 5 such as selectable markers, are not shown. 

Figure 11: A system for identification of compounds that interfere with SH3 
molecular interactions. To screen for compounds that specifically interfere with SH3- 
mediated interactions, the two vectors from Fig 9 and Fig 10 are introduced into 
appropriate cells (by way of example, an not limitation, mammalian cell lines termed 

2 0 COS, CHO, Jurkat, Daudi or 293 cells) where both the GAL4-SH3 and SSL-VP16 fusion 

proteins will be produced. The cells contains a reporter gene under the control of 
GAL4. In the absence of competing molecules, the association of SH3 and the SSL 
will promote the stimulation of transcription through the GAL4 and VP16 partners. 
This results in expression of the reporter gene which can be detected by various 

2 5 methods as described in the text. Compounds that interfere with the SH3-SSL 

interaction will be detected as a reduced (or in some cases) increased level of 
reporter. Certain other reporter systems will promote cell growth in the absence (or in 
other cases in the presence) of an interfering compound. This system is not limited to 
mammalian cells; it may be readily adapted using appropriate expression vectors for use 

3 0 with yeast, insect cells, or bacteria. In addition, this sytstem is not limited to intact cells 

but may also be accomplished using cellular extracts or in vitro 

transcription/translation. Panel A: Representations of molecules in panels B-D. Panel 
B: A cell line expressing GAL4-SH3 and SSL-VP16. In the absence of a test 
compound, GAL4-SH3 binds to SSL-VP16 to promote expression of reporter gene. 
3 5 Panel C: In the same cell line as panel B, the presence of a test compound that binds 
to SH3 prevents association with SSL-VP16. The reporter gene is not expressed. 
Panel D: A control cell line expressing the GAL4-VP16 is used to test the ability of 
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the test compound to interact with other cellular components. The preferred SH3 
inhibitory compound would not affect reporter gene expression in the control cell line. 

Figure 12. Alignment of 17 SH3 domains. The SH3 domain regions from 17 
proteins were aligned by the Megalign program (DNAstar, Madison WI) using the 
5 clustal method with a PAM250 weight table. Amino acids that are conserved in at 
least 10 out of the 17 proteins are shown above the sequences as Consensus #1. Two, 
amino acids, W and G, designated by arrows, are found in all 17 SH3 proteins. 
Dashes represent no amino acid present were included where insertions in other SH3 
domains occur. The consensus homology domain, is defined as the sequence from the . 
1 0 first conserved amino acid (A) through the last conserved amino acid (V). All protein 
listed are from human except YSCH8263_5, a sequence from the yeast S. cerevisiae, 
that was included to demonstrate the homology between human and yeast SH3-like 
domains. 

Figure 13. Sequence Pair Distances for 17 SH3 Domains. An analysis of 

1 5 percent similarity (and percent divergence) was tabulated using the Megalign 

program. The highest percent similarity between two SH3 domains is for Lck and Lyn 
(45.1%). The lowest percent similarity is between p40 and p85A (11%). These data 
were generated using a PAM250 residue weight table. 

Figure 14 shows BIAcore® analysis of the ability of one SH3 ligand (SOS- 

2 0 derived peptide Ac-EVPVPPPVPPRRRPGGK-NH2; solid spheres •) and one SSL 

(Ac-SHRLYRPLPPLPGGK-NH2; hollow spheres o) to competitively inhibit the 
formation of SH3 /ligand complex as described in Example 5. Assays were initiated 
by combining luM Grb-2(P204L) SH3 domain with various concentrations (100, 50, 
25, 10, 5, 2.5, 1 or 0 uM) of peptides and pre-incubated at 25°C for >30 minutes. 

2 5 Complex formed between the SH3 domain and the immobilized ligand was measured 

using the BIAcore Biosensor® (Pharmacia Biosensor, Piscataway, NJ). 

Detailed Description of the Invention 

3 0 Identification of SH3 or SH3-like Domains 

The term "SH3-like domain or a subdomain thereof" refers to a sequence which 
is substantially homologous to a Src homology region 3 (SH3 region), or a subdomain 
of an SH region preferably a conserved region of an SH region. The Src homology 
region is a noncatalytic domain of -60 amino acids which was originally identified in 
3 5 the viral Fps and viral Src cytoplasmic tyrosine kinases by virtue of its effects on both 
catalytic activity and substrate phosphorylation (T. Pawson, Oncogene 3, 491 (1988) 
and I. Sadowski et al., Mol. Cell. Biol. 6, 4396 (1986)). SH3 domains have been found 
in a variety of eukaryotic proteins, some of which function in intracellular signal 
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transduction. Examples (including counterparts from various species) of SH3 
domain-containing proteins include (1) members of the src-family protein tyrosine 
kinases (Src, Lyn, Fyn, Lck, Hck, Fgr, Yes), (2) Grb-2, which has two SH3 domains, 
(3) Sprk, a threonine /serine protein kinase, (4) Tsk, (5) Btk, (6) Txk and other Tec 
5 family members (see e.g. Haire et al, Hum Mol Genet 30, 897 (1994), (7) Vav, (8) 
GTPase Activating Protein (GAP), (9) p40, p47, and p67 proteins of the neutrophil 
oxidase complex, and (10) phosphatidylinositol 3' kinase, (11) Crk, (12) 
phospholipase C gamma, (13) Abl. SH3 domain-containing proteins have been 
identified in human, rodent, bovine, C. elegans, and yeast. 
1 0 In Figure 12, primary amino acid sequences of a selection of 17 known SH3 

domains have been aligned by the "clustal method" using multiple alignment software 
from DNAstar (Madison, WI). As shown in Figure 12, residues which are conserved 
in 10 out of 17 sequences are listed above the alignment as Consensus #2. This criteria 
identified 18 consensus amino acids and the region of each protein including these 

1 5 amino acids is know as the consensus homology domain.. The length of the consensus 

homology domain— that is, the number of amino acids from the first conserved amino 
acid of the domain (alanine, A) to the last conserved amino acid of the domain 
(valine, V) varies from 48 amino acids for Grb2 (C-terminal domain), p47 (C-terminal 
comain), p67 (both domains) to 65 amino acids for p85 (PB'kinase). However, other 

2 0 SH3 consensus regions may be longer or shorter. Insertions or deletions compared to 

Src SH3 domain can occur anywhere within the consensus region. For example, the 
PI3K SH3 domain, neuronal Src SH3 domain and Sprk SH3 domains contain inserted 
amino acids between the VL and WW consensus motifs. All 17 of the SH3 domains 
shown in figure X contain a conserved tryptophan (W) and glycine (G) within the 

2 5 domain. Some SH3 domains, such as human Sprk SH3, contain all 18 consensus 

amino acids. Others contain 9-17 of the consensus amino acids. Such an analysis may 
be useful in identifying new SH3 or SH3-like domains from unknown DNA, RNA, or 
protein sequences. For example, a protein coding sequence can be aligned with the 18 
consensus amino acids, or Src SH3 or any of the other SH3 or SH3-like domains and 

3 0 scored for the presence of the consensus amino acids. An SH3-like domain may be 

recognized by having between greater than 7 of these conserved amino acids but SH3- 
like domains may have fewer than 7 conserved amino acids. An alternative method 
for identification of SH3-like domains is through the use of sequence distance 
calculations such as shown in Figure 13. This figure shows the percent similarity 
3 5 between pairs of the 17 SH3 domain sequences in the alignment. The highest sequence 
identity among this set of SH3 domains was Lck SH3 compared to Lyn SH3 with a 
%sequence identity of 45.1. Two SH3-like domains may exhibit higher or lower values 
than shown here. The lowest value was 11.0% between p40 and p85 SH3 domains. 
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Abbreviations for the amino acid residues are: A, Ala; C, Cys; D, Asp; E, Glu; 
F, Phe; G, Gly; H, His; I, lie; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, 
Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 

Certain SH3 or SH3-like domains may not match any of the 18 conserved 
5 amino acids nor exhibit significant homology with known SH3 domain sequences to 
be detected by computer alignment programs. These sequences may, nevertheless, 
exhibit the same or similar three-dimensional structure as known SH3 domains and 
function as an SH3-like domain. The three-dimensional structure of several known 
SH3 domains have been determined. SH3 domains are characterized as two anti- 
1 0 parallel beta sheets composed of 5 or 6 beta strands. Regions forming an alpha helix 
may or may not be present within the domain. SH3 or SH3-like domains may be 
recognized as having an SH3-like domain structure when solved by x-ray 
crystallography or NMR spectroscopy. Alternatively, a predicted structure by 
homology modeling may be used to identify a particular protein sequence as an SH3- 

1 5 like domain. 

The identification of SH3 or SH3-like domains may be accomplished by 
screening a cDNA expression library with an SSL for known SH3 domains to isolate 
cDNAs for SH3 proteins. One could use PCR or low stringency screening with an 
SH3-specific probe. The SH3 domain or protein containing the SH3 domain may be 

2 0 isolated from naturally occuring sources (e.g. cells, tissues, organs, etc); produced 

recombinantly in bacteria, yeast or eukaryotic cells; produced in vitro using cell free 
translation systems; or produced synthetically (e.g. peptide synthesis). 

The subject invention is relevant to SH3 and SH3-like domains as described in 
the foregoing paragraphs. 

25 

Further definitions 

The following additional terms are defined in the interest of clarity and to aid 
in understanding the invention. 

3 0 A specific SH3 ligand or SSL is a ligand which binds selectively to an SH3 domain 

of interest. 

As used herein, "selective" binding to, blocking or inhibiting means "capable of 
binding to, blocking or inhibiting more effectively with respect to the SH3 domain of 
interest than with respect to at least one other SH3 domain". This selectivity may be 
3 5 manifested biochemically as an increased on-rate (K aS sociation) and /or a decreased 
off-rate (Kdisssociation) for the SH3 domain of interest, e.g. as determined by kinetic 
methods including BIAcore® analysis or as a lower Kd as measured by any 
conventional binding assay methodology or as a larger Gibbs free energy of binding 
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(Go) as measured or calculated by standard methodologies. Alternatively, this 
selectivity may be manifested as a lower IC50 of a test substance for competitive 
inhibition of binding of an SH3 ligand to an SH3 domain of interest, whether 
determined in vitro or in cell-based assays (e.g., via expression of a reporter gene) or a 
5 lower ED50 of a test substance in a cell, organ or animal model where 1) the 

involvement of an SH3 target in the outcome of the assay can be demonstrated; and 
2) binding of the test substance to the SH3 target can be demonstrated in vitro by any 
of the above means. In particular cases, the degree of selectivity is preferably at least 
2-fold, and even more preferably at least 3-fold, as measured by any of the foregoing 
1 0 types of measurement. Degrees of selectivity of greater than about 5-fold and greater 
than about 10-fold and higher are achievable in individual cases. 

A substance identified by the methods of this invention, which selectively 
binds to an SH3 domain of interest, or blocks or inhibits protein-protein, protein- 
peptide, protein-nucleotide, protein-polynucleotide, protein-lipid, protein- 

1 5 carbohydrate or protein-small molecule interactions mediated by an SH3 domain of 

interest, is referred to as an SH3 binding agent or SH3 blocking agent. Substances to 
be assessed for their ability to bind selectively to an SH3 domain of interest can be 
obtained from a variety of sources, including for example microbial broths, cellular 
extracts, conditioned media from cells, synthetic compounds and combinatorial 

2 0 libraries. Once an agent has been identified as an SH3 binding or blocking agent, it 

can be produced using known methods, such as by recombinant methods of protein 
production or chemical synthesis. It can also be obtained from the source in which it 
was initially identified {e.g., a microbial broth, cellular extract, etc. as above). Specific 
SH3 ligands identified by screening bacteriophage display libraries are themselves 

2 5 useful in certain applications as SH3 binding or blocking agents. Binding or blocking 

agents for possible therapeutic applications can be further evaluated by conventional 
methods with respect to toxicological and pharmacological activity. 

As mentioned above, pharmaceutical agents which interfere with the forma- 

3 0 tion or stability of signaling complexes (such as those formed by signaling proteins 

containing one or more SH3 domains with their natural ligands) can be used to treat 
or prevent the diseases or their pathological effects mediated by such complexes— but 
only if these interfering agents have sufficient binding selectivity. Unfortunately, until 
the work described herein, very little was known about the natural ligands or the 
3 5 binding specificities of the various SH3 domains to determine whether selective 

interference with SH3 ligand binding would be possible, and if it were, to permit the 
identification or design of agents capable of so acting. Indeed, the limited apparent 
sequence requirements (proline rich sequences) for binding to SH3 domains, together 
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with the limited expected affinities of interactions therewith were considered 
discouraging, especially in comparison to emerging information for SH2 domains. 

Despite such considerations, the present inventors have now developed a 
method for identifying ligands, referred to as specific SH3 ligands (SSLs), which bind 
5 selectively and specifically to SH3 domains of various proteins with high affinity. 
These ligands are useful in the subject method for identifying agents which selectively 
and specifically block or inhibit protein-protein interactions via a preselected SH3 
domain. Such agents may be useful in the therapy of diseases and conditions 
mediated by such protein-protein interactions. The specific SH3 ligands described 
1 0 herein, such as those presented in Figures 1 to 8, are also SH3 binding agents. 

The following is a description of our work, including the discovery of the 
binding specificity of SH3 domains, and applications of these important findings. 

Identification and characterization of SSLs 

1 5 SSLs are preferably selected from combinatorial arrays (biased or unbiased) 

of candidates as can be provided by degenerate peptide libraries which can be 
readily prepared in solution, in immobilized form or as phage display libraries. 
Combinatorial libraries can also be synthesized of peptides containing one or more D- 
amino acids and /or non-naturally occuring amino acids. Libraries can also be 

2 0 synthesized of peptoids and non-peptide synthetic moieties. Such approaches 

provide access to candidate SSLs containing novel peptide sequences, peptide 
sequences not found in nature and novel non-peptide strucutres. 

SSLs are selected from the candidate molecules using the SH3 domain of 
interest, typically in the form of the protein containing the SH3 domain, a fragment of 

2 5 such protein, a peptide or fusion protein containing the amino acid sequence of the 

SH3 domain of interest. Additional guidance in this regard is provided below. 

Selection may be effected by combining one or more candidate SSLs with the 
SH3 domain of interest, incubating the resultant mixture under conditions permitting 
the formation of an SH3 domain-ligand complex, separating the complexes so formed 

3 0 from uncomplexed candidates and identifying the candidates which bound to the 

SH3 domain. 

We have found phage display to be particularly effective in the identification 
of peptide SSLs. Interestingly, SSLs so identified usually comprise novel peptide 
sequences not represented by known peptide or DNA sequences. We have developed 
3 5 this approach for identifying SSLs of varying lengths ranging from minimal core 

sequences to longer sequences which also contain one or more, and often preferably 
several more, flanking amino acid residues, as discussed in further detail below. See 
also Rickles et al, 1994, EMBO Journal 13(23):5598-5604, the full contents of which 
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are incorporated herein by reference. By the means disclosed herein, SSLs may be 
obtained with measurable Kd values, with respect to an SH3 domain of interest, in 
the range of about 7 - 30^iM, more preferably in the range of about 2-8 ^iM, and even 
more preferably in the range of about 0.22 \jM to about 2 (iM. In some cases SSLs 
5 with even lower Kd values for an SH3 of interest, which may be obtained as 

described herein, will be preferred. Kd values may be readily measured, for example, 
using BIAcore techniques, as disclosed herein, or by flourescence spectroscopy (see 
e.g. Chen et al, J Am Chem Soc 115, 12591-12592 (1993)). 

1 0 Preparation ofSH3-like Domain for Selection of SSLs or Compound Screening 

In one embodiment, one prepares a phage library (using e.g. ml 3, fd, or 
lambda phage), displaying inserts of from 4 to about 80 amino acid residues using 
conventional procedures. The inserts may represent a completely degenerate or biased 
array. By selection against the SH3 domain of interest one selects phage bearing 

1 5 inserts which bind to the SH3. This process can be repeated through several cycles, 

typically 2 to about ten, of reselection of phage that bound. Repeated rounds lead to 
enrichment of phage bearing particular sequences. DNA sequence analysis is 
conducted to identify the sequence(s) of the expressed peptides. By analysis of the 
peptide sequence of the selected phage a preferred or consensus SH3 ligand sequence 

2 0 can be identified. The minimal linear portion of that sequence that binds to the SH3 

domain of interest is termed the "core". This can be determined using synthetic 
peptides, with phage display or other techniques. It will often be preferred to start 
with phage inserts of seven to ten residues and identify a consensus selected 
sequence. One can repeat this procedure using a biased library containing inserts 

2 5 containing part or all of the consensus sequence plus one or more additional 

degenerate residues upstream or downstream thereof. Sequential rounds of such 
selections permit identification of SSLs containing flanking sequence. (Identification 
of core and flanking sequences may also be conducted by analgous conventional 
procedures with synthetic peptide libraries in place of phage libraries. In that case, 

3 0 the design of the peptide library is analogous to the design of the phage inserts.) 

SH3 expression vectors can be constructed by ligating into a conventional 
expression vector the DNA sequence encoding the consensus homology domain for 
the SH3 domain of interest, alone or preferably with additional flanking sequence. For 
example, for human Src the SH3 consensus homology domain includes amino acids 
3 5 91-140. We have prepared SH3 domain protein from E. coli expression vectors using 
amino acids 84-145, which contain an additional 7 amino acids on the N-terminal 
side of the homology domain and 5 amino acids on the C-terminal side. These 
additional amino acids may provide stability to the SH3-like domain, may improve 
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its production in the expression cells, may improve its ability to interact with ligands 
or other proteins or be necessary for linking to a fusion protein for reasons discussed 
below. 

The SH3 domain may be expressed within all or part of its natural context, as 
5 an isolated SH3-like domain, in a tandem array of SH3-like domains containing two 
or more of the same or different domain, or as a fusion protein with other unrelated 
domains including but not limited to SH2-like domains, protein kinase domains 
glutathione S-transferase (GST), epitope tags, kinase recognition sequences, maltose 
binding protein, signal sequences, biotin-modification sequences. 
1 0 The SH3-like domains may be modified to: 

• facilitate purification e.g. by expression as a fusion to glutathione-S-transferase, 
maltose binding protein, metal-chelation sequences (poly-histidine), protein A or 
others; 

• facilitate identification or quantitation, e.g. by covalent modification using biotin, 

1 5 fluorophores, chromophores, scintillons, spin labels, radioactive or non- 

radioactive isotope tags, magnetic particles, metal coloids, etc; 

• generate a measurable signal, e.g. by expression as a fusion to alkaline 
phosphatase, horse radish peroxidase, green protein, luciferase, etc; 

• adhere to defined solid supports, e.g. by expression as a fusion to an epitope tag 

2 0 or other antigenic domain; engineered to provide unique or uniquely accessible 

protein features e.g. N-terminal serine, cysteine, lysine or others, etc. 

• remove undesirable features that do not effect SH3 function but pose 
experimental complications, e.g. by mutation of cysteines that participate in 
unnatural domain dimerization 

2 5 • improve stability under conditions of binding assays (e.g. by altering the natural 

coding sequence to encode cysteines that form stabilizing disulfides). 

Selection ofSSLs 

Initially, a phage library containing displayed peptides consisting of six 

3 0 randomized residues ("X" represents any amino acid) bracketed by glycines (- 

GXXXXXXG-) (SEQ ID NO: 157), was screened using the Src SH3 domain, prepared 
as part of a GST fusion. Phage which bound the GST-SH3 fusion were isolated and 
sequenced. Enrichment was observed when using GST-Src SH3 but not when using 
GST. The sequencing of ten Src SH3 domain-selected phage showed that each phage 
3 5 insert had the same nucleotide sequence, which encoded a proline-rich insert. 

Furthermore, the Src SH3 selected peptide was seven residues in length, not six, due 
to a point mutation that converted a glycine residue to arginine: 
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(SEQ ID NO: 157) GXXXXXXG (starting library) 

(SEQ ID NO: 158) RSLPPIPG (Src SH3-selected sequence) 

This fortuitous mutation probably occurred during phage propagation. That this was 
5 the only peptide selected when using the src SH3 domain suggests that a Src SH3 
ligand should be, at a minimum, seven residues in length, proline-rich and contain an 
arginine residue as part of the recognition sequence. 

A second and biased library was prepared, in which the phage displayed 
peptides containing six random amino acid residues flanking the tetrapeptide 
1 0 proline-rich sequence identified in the first screen (-X6PPIPG-). This new library, 
designated the X 6 PPIP library (SEQ ID NO: 3), was screened using GST- Src SH3. 
The amino acid sequences of peptides identified in this manner are shown in figure 1. 
These results support the view that the Src SH3 recognition sequence is proline-rich 
and at least seven amino acid residues in length. Of the Src SH3-selected sequences, 

1 5 while no amino acid preferences were observed at positions 1, 2 and 3 of the six 

randomized residues, arginine was preferred at position 4, proline at position 5, and 
leucine at position 6. That arginine is preferred at the fourth position supports our 
view that the point mutation observed in the isolate from the first library (changing 
glycine to arginine) was critical for Src SH3 selection. 

2 0 The X6PPIPG library was also screened using Fyn, Lyn, Yes, PI3K, Tsk, Abl 

GST-SH3 fusion proteins and with GRB2 (a 25 kD protein containing two SH3 
domains). Each of these proteins enriched phage from the X6PPIPG library. The 
results from sequencing selected phage are shown in Figure 1. Like the Src SH3 
domain, the Fyn, Lyn, Yes and PI3K SH3 domains and GRB2 prefer ligands that 

2 5 contain arginine at position 4, proline at position 5 and leucine at position 6. 

However, the Lyn, Yes, PI3K and GRB2 SH3 domains have additional binding 
preferences. 

In contrast to the ligands identified using the Src, Fyn, Lyn, Yes, PI3K and 
GRB2 SH3 domains, the Tsk and Abl SH3 domains selected ligands that were quite 

3 0 disparate. The Abl SH3 domain, for example, showed preferences for peptides 

containing prolines at the first three positions of the six randomized residues. 
Furthermore, tyrosine and proline were selected at the 4th and 6th position 
respectively. 

To investigate the sequence preferences in the "PPIP" region of the SH3 
3 5 ligands, three new biased libraries were generated. The library RSLRPLX6 was 
prepared to identify additional Src, Fyn, Lyn and PI3K SH3 ligands. The library 
PPPYPPX6 was prepared and screened to further define Abl SH3 binding preferences 
and the library RLYRPLX6 was used to evaluate GRB2 ligand preferences. 
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Bacteriophage were selected from these new libraries and sequenced. As 
shown in Figure 2, the proline motif found originally by screening the GX^G library 
using the Src SH3 domain was again selected, this time by all SH3 domains whose 
binding preferences were analyzed. The C-terminal consensus sequences derived from 
5 the RSLPRLX6 library are PPLPXP (Src), PP(I/L)PXX (Fyn), PPLPXP (Lyn), PPLPPP 
(PI3K). When using the PPPYPPX6 library, the Abl SH3 domain prefers peptides that 
contain the C-terminal residues PP(I/V)PXX. GRB2 selects SH3 ligands that contain 
the residues PPLPXX. The minimum number of selected consensus residues that 
suffices for recognition by an SH3 domain we here term the core recognition sequence. 
1 0 The Src SH3 domain, for example recognizes and prefers the sequence RPLPPLPXP. 
In contrast, the Abl SH3 domain prefers the core consensus sequence 
PPPYPPPP(I/V)P. 

To rapidly evaluate the binding properties of the phage-displayed ligands, 
and to gain insight into SH3 domain selectivity, the relative binding affinity of phage 

1 5 isolates was measured by a novel application of surface plasmon resonance. The 

binding of phage-displayed ligands to immobilized GST-SH3 proteins was analyzed 
using a BIAcore Biosensor®- To obtain preliminary evidence whether the Abl and Src 
selected phage showed selective binding for each of these SH3 domains, relative 
binding affinity was measured. As shown in Figure 3, binding of phage containing the 

2 0 Src SH3 selected consensus sequence was readily detected using GST-Src SH3 but not 

using GST-Abl SH3. Conversely, phage displaying the Abl SH3-selected sequence 
PPPYPPPPIP bound to GST-Abl SH3 but binding to GST-Src SH3 was not observed. 

Peptides corresponding to phage displayed sequences may be synthesized 
and used to more accurately quantify their binding to SH3 domains, e.g. using surface 

2 5 plasmon resonance techniques. For example, using the "Src" and "Abl" peptides, 

binding was studied in vitro by surface plasmon resonance. Again, Src and Abl SH3 
binding was selective. The Abl SH3 domain bound with high affinity to the Abl and 
not the Src-selected peptide. Conversely, the human and avian Src SH3 domains 
recognized the Src but not the Abl-selected sequence. The results are summarized in 

3 0 Table 1. 
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Table 1 



Abl 

(X ± SEM) 



hSrc 

(X ± SEM) 



aSrc 

(X ± SEM) 



aePPPYPPPIPggk 
aeRSSRPLPPIPggk 
kggAPTMPPPLPPVPPg 



2.0 ±0.2 NDB NDB 
NDB 19.5 ± 1.1 58.5 ± 10 



66.3 ± 19.3 NDB 



NDB 



Table 1. Binding affinities ofSrc and Abl SH3 domains for peptides. Binding of human 
Src (hSrc), avian Src (aSrc) and abl (abl) SH3 domains to Sroselected 
(aeRSSRPLPPIPggk), abl-selected (aePPPYPPPPIPggk) and 3BP1 
(kggAPTMPPPLPPVPPg) sequences. values are umolar and represent the Mean 
± SEM (n=3). NDB: no detectable binding at a protein concentration of 50 ^iM. 



We have exploited these methods using biased phage libraries to more fully 
define SH3 domain ligand binding preferences. We have discovered that SH3 
domains that prefer almost identical core sequences may still have quite distinct 
5 binding preferences. Additional biased libraries were prepared which displayed the 
core sequence RPLPPLPPP adjacent to five random upstream residues 
(X5RPLPPLPPP). This library was screened using the Src, Fyn, Lyn, Yes and PI3K 
SH3 domains. As shown in Figure 4, selected ligands had preferences at positions 
upstream of the core sequence. Some observed preferences are shared by subclasses 
1 0 of SH3 domains while other preferences are distinct for a particular SH3 domain. 
Additional sequence preferences were observed when a library was screened which 
contained five random residues downstream of core sequences (Figure 5). 

We have analyzed the effect that flanking sequences can have on 
binding affinity. As shown in Figure 6, phage displaying a core sequence binds with 

1 5 similar affinity to the Src and Lyn SH3 domains as determined by BIAcore® analysis. 

In contrast, phage displaying an SH3 ligand that consists of the core sequence and Src 
SH3-selected upstream flanking sequences bind with higher affinity to the Src SH3 
domain than to the Lyn SH3 domain. Conversely, phage displaying a ligand 
consisting of the core sequence and Lyn SH3-selected upstream flanking sequences 

2 0 bind with higher affinity to the Lyn SH3 domain than to the Src SH3 domain. Binding 

analysis, using peptides instead of phage reveals that a ligand consisting of the Src- 
selected upstream flanking sequence and the core sequence (VSLA RRPLPPLPPP. the 
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core recognition sequence is underlined) binds with 33-fold higher affinity (Kd= 0.8 
um) to the Src SH3 domain than peptides that contain only the core sequence 
(GG5GGRPLFPLPFP or RPLPPLPP. Kd's = 27um). The results just described 
demonstrate that sequences which flank the core recognition sequence are important 
determinants of both binding affinity and binding selectivity. 

The solution structures of Src SH3 domain-ligand complexes revealed that the 
Src SH3 domain can bind the Src SH3 preferred core ligand sequence (RPLPPLP) 
when it is in the opposite orientation (Feng et al (1994), Science 266, 1241-1247). 
With class II ligands, the conserved core arginine residue is at the COOH-terminus, 
not the amino-terminus (RPLPPLP) as is found for what is now termed the class I Src 
SH3 ligand core sequence. We designed a phage library that displayed a class II core 
and five random downstream residues ( G A APPLPPRX 5 , the conserved arginine 
residue is in bold type, the core recognition sequence is underlined) to determine 
whether there might be preferred flanking sequences for class II ligands. The library 
was screened using the Src, Fyn, Lyn, Yes and PI3K SH3 domains. The results from 
sequencing selected phage is shown in Figure 7. As with class I ligands, there are SH3- 
specific sequence preferences adjacent to the class II ligand core. Furthermore, we find 
that for each SH3 domain analyzed, the sequence preferences adjacent to class I and 
class II cores are distinct. For example, the Src SH3 domain selects the residues 
XSLXX adjacent to the arginine of the class I core (XSLXX RRPLPPLPPP. the class I 
core sequence is underlined, the critical arginine of the class I core is in bold type). In 
contrast, Src SH3 ligands identified using class II libraries have the consensus 
sequence NRPRL adjacent to the class II arginine residue (GAAPPLPPRNRPRL). 
Binding analysis, using peptides, reveals that a ligand consisting of a class II core 
and Src-selected downstream flanking sequences (GAAPPLPPRNRPRL, the core 
recognition sequence is underlined) binds with fifty-fold higher affinity (Kd= 0.22 p,m) 
to the Src SH3 domain than peptides that contain only the core sequence 
(GAAPPLPPR, Kd = 11.4jim). 

The Src SH3 selected flanking sequences are important determinants of Src 
SH3 domain recognition. To determine how specific these residues are for SH3 
domain recognition, binding assays were performed using the Lyn, PI3K, Fyn and Yes 
SH3 domains As shown in Table 2, a peptide containing class 1 core residues and 
the Src SH3 selected flanking sequence (VSLAR) binds at least 50 fold better to the 
Fyn and Yes SH3 domains compared to a peptide containing just the core recognition 
sequence. In contrast, the Src SH3 selected flanking sequences only marginally affect 
(2x) the affinity of the interaction of the core ligand residues with the PI3K SH3 
domain and have no effect on binding to the Lyn SH3 domain. 
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Table 2 
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Class 1 SH3 Ligands 






Core : 

Core+Src SH3 selected flanking sequence 


RPLPPLPGGK 
: VSLARRPLPPLPGGK 


1 0 


Binding 


assay (Kd: (IM) 




SH3 Domain Core Deotide 


Core+Src flank 


1 5 


Src 27.0 


0 . 8 




Lyn 5 . 4 


4 . 0 




PI3K 3 0.8 


15 . 9 


20 


Fyn 2 9.1 


0.7 




Yes 62.0 


0 . 7 


2 5 








Class IX SH3 Ligands 




30 


Core : 

Core+Src SH3 selected flanking sequence 


KGGGAAPPLPPR 
: KGGGAAPPLPPRNRPRL 




Binding 


assav (Kd: uM} 

******** j \ • i 


O O 


SH3 Domain Core peptide 


Core+Src flank 




Src 11.4 


0 .22 


40 


Lyn 2 7.6 


6 . 2 




P13K n.d. 


n.d. 




Fyn 9 9.5 


1.7 


45 

c rv 


Yes >99.5 
(n.d.= not determined) 


3 . 6 



Table 2. Binding affinities of the Src, Fyn, Lyn, Yes and 
PI3K SH3 domains to class 1 and class 2 core peptides and 
core peptides containing Src SH3 selected flanking 
sequences . Kd values were determined by Tryptophan 
5 5 Fluorescence measurements as described (Chen et al (1994) 
J. Am. Chem. Soc . 115, 12591-12592) 
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Similar results to those described above were obtained when comparing a 
class 2 core peptide with one that also has the C-terminal residues (NRPRL), 
identified using the Src SH3 domain. As observed with the Src SH3 domain, the C- 
terminal flanking sequences dramatically increase the affinity of core ligand binding to 
5 the Fyn and Yes SH3 domains. In contrast, binding to the Lyn SH3 domain was only 
marginally (about 4 fold) increased. 

Binding was also analyzed using the Src SH3 domain and class 2 peptides 
that contained PI3K, Lyn or Fyn selected flanking sequences. As discussed above, 
the Src SH3 domain binds to a class 2 core (KGGGAAPPLPPR, the core recognition 
1 0 sequence is underlined) with a Kd = 11.4 pM. The addition of the Src SH3 selected 
C-terminal residues NRPRL increases the affinity about 50 fold (Kd = 0.22 |iiM). A 
peptide which consists of a class 2 core and Fyn selected sequences (NPRPL) binds 
to the Src SH3 domain with a Kd of 1.8 ^iM (6 fold increase in affinity). In contrast, 
class 2 core peptides containing Lyn (PSWMS) or PI3K (PPRPA) selected C- 

1 5 terminal residues bind with affinities similar to that observed with a class 2 core 

peptide (7.4 \iM and 23 |LiM respectively). 

Using the methods described above and elsewhere herein, one may thus 
identify peptides (and non-peptide agents) which selectively bind to an SH3 domain- 

2 0 containing protein of interest. SH3 core ligand sequences may be distinct (for 

example, see Figure 3; Src SH3 and Abl SH3-selected core sequences). In addition, 
residues that flank the core sequence can be distinct (for example, see Figures 4, 5 and 
7). These flanking residues are important for selective recognition by SH3 domains. 
Such SSLs may then be used to identify agents which selectively block or interfere 

2 5 with the binding of a selected SH3 domain containing protein to its ligands without 

substantially interfering with the normal binding of other SH3 domain containing 
proteins, whether or not within the same family. 

Once an SH3 binding agent has been identified based on its ability to prevent 
binding of the specific SH3 ligand to the SH3 domain of interest, it can be further 

3 0 characterized, after it is separated, if necessary, from the test mixture. For example, 

the affinity with which the SH3 binding agent binds the SH3 domain of interest (or 
other SH3 domains) or otherwise inhibits SH3-mediated interactions, its composition 
or chemical nature and its biological/pharmacological activity can be assessed, using 
known methods. Those SH3 binding agents which do not bind to or interefere with 
3 5 other SH3 domains, or which do so with very low affinity, are of particular interest, 
since they will provide a basis for pharmaceutical agents useful as highly specific 
inhibitors of an SH3 domain-containing protein of interest which do not inhibit other 
SH3 domain-containing proteins. 
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In vitro assays 

SH3 binding or blocking agents may be identified based on their ability to inhibit 
formation of an SH3/ligand complex or to bind to SH3 domains using BIAcore® and 
allied technologies (See Example 5) or based on their ability to inhibit formation of an 
SH3/ligand complex using solid phase binding assays. 

SH3 ligand domains 

SH3 ligand domains for use in in vitro assay will generally contain at least about 
6-10 amino acid residues, and in some cases up to about 100 amino acid residues or 
more, although generally they will contain between 6 and about 40 amino acid 
residues. Ligand domains may be selected from known ligand sequences for an SH3 
domain of interest, may be identified by the phage display method disclosed in 
Example 1, or otherwise identified from cell broth, natural extracts, synthetic 
libraries, etc. Ligands may also be phage-displayed peptides, peptide "side chains" 
displayed on non-peptide backbones (peptoids), cells or cell fragments, organic 
molecules, etc. or hybrids of the above. The SH3 ligand domains may be modified to: 

• facilitate purification e.g. expression as a fusion to glutathione-S- 
transferase, maltose binding protein, metal-chelation sequences (poly- 
histidine), protein A or others; 

• facilitate identification or quantitation, e.g. covalent modification using 
biotin, fluorophores, chromophores, scintillons, spin labels, radioactive or 
non-radioactive isotope tags, magnetic particles, metal coloids, etc; 

• generate a measureable signal, e.g. expression as a fusion to alkaline 
phosphatase, peroxidase, green protein, luciferase, etc; 

• adhere to defined solid supports, e.g. fusion to an epitope tag or other 
antigenic domains; engineered to provide unique or uniquely accessible 
features e.g. N-terminal serine, cysteine, lysine or others, etc. 

Alternatively, SH3 ligand domains may be synthesized on solid supports (synthetic 
resins; glass, silica, metal; magnetic beads; others) and assayed without detachment. 

Labelling with fluorochromes, reactive groups or biotin can be accomplished 
using known methods (for example see Chemistry of Protein Conjugation and Cross- 
Linking, 340 pp, S. S. Wong, CRC Press, Inc., Boca Raton, FL, 1991). Examples of 
suitable fluorochomes include fluorescein and derivatives, Rhodamine B and 
derivatives, fluorescamine, eosin isothiocyanate, dansyl chloride, 7-amino-4- 
methylcoumarin,N-(4-anilino-l-napthyl)maleimide, 4'6-diamidino-2-phenylinole; 
examples of suitable reactive groups include diethylenetriaminepentaacetic 
anhydride, N-ethylmaleimide, 2-iminothiolane HC1; biotinylation reagents include 
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iodoacetyl-biotin, 14}iotinamido,-4-maleimidomethyl-butane and N- 
hydroxysuccinimidobiotin. 

Preparation of radiolabelled materials can be accomplished using, for example, 
kinase catalysed 32P phosphorylation (see e.g. Protein Phosphorylation, Part A. 
5 (1991), T. Hunter and B. M. Sefton, eds., Methods in Enzymology Vol. 200, 
Academic Press, Inc, New York, NY, 763 pp.), chloramine-T (Greenwood et al, 
Biochem. J. 89:114, 1963), lactoperoxidase (Marchalonis et al, Biochem. J. 124:921, 
1971), Bolton-Hunter reagent (Bolton and Hunter, Biochem. J. 133:529, 1973) or Iodo- 
beads mediated 1251-iodination (Markwell Anal. Biochem. 125:427, 1982) or tritium 
1 0 incorporation by reductive methylation (Tack et al., J. Biol. Chem. 255:8842, 1980). 

In vivo assays 

Among other features, this invention provides genetically engineered cells, 
preferably eucaryotic cells, useful for high throughput identification of compounds 

1 5 which interfere with, inhibit or otherwise impede interactions mediated by an SH3 

domain. The cells of this invention contain and are capable of expressing a reporter 
gene and genes encoding two fusion proteins. The two fusion proteins are capable of 
binding to each other and when they do so, of detectably activating the transcription 
of the reporter gene. 

2 0 The first fusion protein is a DNA-binding fusion protein which contains a first 

domain capable of binding to a regulatory element linked to the reporter gene and a 
second domain which comprises one member of an SH3 binding pair. 

The second fusion protein is a transcriptional activating fusion protein which 
contains a first domain capable of activating the transcription of a gene with which it 

2 5 becomes associated and a second domain which comprises the other member of an 

SH3 binding pair. 

An SH3 binding pair consists of a first peptide sequence comprising an SH3 
domain of interest and a second peptide sequence comprising a specific ligand for 
that SH3 domain of interest. Viewed from this perspective, one of the fusion proteins 

3 0 contains an SH3 domain, the other contains a ligand for that SH3 domain. The two 

fusion proteins contain additional domains capable, upon complexation of the fusion 
proteins, of activating transcription of a reporter gene under the expression control of 
a DNA element responsive to those additional domains. 

One or both of the fusion proteins may also contain one or more optional 
3 5 domains or elements, including a domain capable of regulatably sequestering the 
fusion protein at a desired cellular location or compartment. 

In the practice of this invention, cells containing the components mentioned 
above are cultured under suitable conditions and in a suitable culture medium 
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permitting cell growth and a detectable level of expression of the reporter gene. A test 
substance is added to the medium, the cells are cultured for an appropriate 
incubation period and the level of expression of the reporter gene is measured. A 
decrease in the level of expression of the reporter gene indicates that the test 
substance is a candidate inhibitor of the binding of the SH3 binding pair. 

The assay may be repeated using a control cell containing and capable of 
expressing the reporter gene and its associated control elements and a gene encoding 
a control fusion protein. The control fusion protein contains both the DNA binding 
domain and the transcriptional activating domain which are separately present in the 
respective DNA-binding and transcriptional activating fusion proteins, previously 
described. Alternatively, two fusion proteins may be used, each containing one half of 
a binding partner with different binding specificities then those found for SH3 
domains and the protein-protein interaction assayed as described above. The failure 
of the test substance to significantly alter the level of expression of the reporter gene 
confirms its activity as an SH3 binding inhibitor. 

in vivo Components & their use 

1 Host cells 

Any cells, preferably eucaryotic cells, which can be transformed with 
heterologous DNA and which can be grown or maintained in culture may be used in 
the practice of this invention, including yeast and mammalian cells. Mammalian cells 
include those of mouse, hamster, rat, rabbit, dog, cow or primate, including human, 
origin. They may be of a wide variety of tissue types, including mast cells, fibroblasts, 
osteoclasts, osteoblasts, T cells, macrophages and neutrophils, and may be primary 
cells or cell lines.In one embodiment, cell-free transcription systems prepared from 
cells as described above or prepared from purified components may be used in lieu of 
cells. 

As discussed below, the cells are engineered by the introduction of heterologous 
DNA(s) encoding (a) the DNA-binding fusion protein, (b) the transcriptional 
activating fusion protein and (c) a reporter gene under the transcriptional control of a 
promoter responsive to the complex of both fusion proteins. The DNA molecule 
encoding those components may be introduced into the cells as separate DNAs or 
linked together. 

2 Reporter Gene Constructs 

Suitable reporter genes include any gene whose expression can be detected. Such 
genes include those encoding an enzyme which catalyzes a reaction resulting in a 
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detectable color change, an enzyme that permits continued growth in the presence of 
an otherwise toxic agent, a repressor or suppressor of a biological function which is 
activated or enhanced by the reduced expression of the " reporter" gene. Examples of 
reporter genes which may be used in the practice of this invention include genes 
5 encoding enzymes such as luciferase, beta-galactosidase, alkaline phosphatase or 
horseradish peroxidase which catalyze reactions generating readily detectable 
products. Alternatively, the reporter gene may encode a readily detectable cell surface 
marker such as Thyl. One well known reporter gene is the gene encoding secreted 
alkaline phosphatase (SEAP). The production of SEAP may be monitored from the 
1 0 culture media without the need to lyse the cells. This permits convenient detection, 

using a colorometric or fluorescent assay, at different time points after the addition of 
a test compound to the cell culture. 

The reporter gene will be linked to a promoter which is responsive to a 
transcriptional activater capable of binding to the promoter and activating 

1 5 transcription therefrom. The promoter may be a synthetic promoter consisting of 

multiple binding sites, upstream of a TATA box element, for the DNA binding 
domain of a transcription factor. The number of such binding sites can be adjusted to 
optimize the overall level of reporter gene expression induced by the fusion proteins 
of this invention. Examples of such binding sites include GAL4 binding sites (see e.g. 

2 0 Acheson, 1980, DNA Tumor Viruses, (Cold Spring Harbor, NY, Tooze, J. ed.) pp 

151-160; Peden et al, 1980, Science 209:1392-1396; Liu & Green, 1990, Cell 61:1217- 
1224; and Ptashne & Gann, 1990, Nature 346:329-331) for embodiments using GAL4 
as the DNA-binding domain of the DNA-binding fusion protein and VP16 as the 
transcriptional activating domain of the cognate transcriptional activating protein. 

2 5 Additional guidance in the use of DNA-binding and transcriptional activating 

domains and promoters responsive thereto may be obtained from Fields et al, US 
Patent No 5,283,173 (1 Feb 1994) and Vasavada et al, 1991, Proc Natl Acad Sci 
USA 88:10686-10690. 

3 0 3 Fusion protein Constructs 

The two fusion proteins of this invention will typically contain either (a) one 
or more DNA binding domains or (b) one or more transcriptional activating domains, 
in addition to one or more SH3 domains or ligands thereto. One embodiment of the 
invention involves a pair of fusion proteins, the first containing one or more DNA 
3 5 binding domains and one or more SH3 domains, the second containing one or more 
transcriptional activation domains and one or more SH3 ligand domains. Another 
embodiment involves a pair of fusion proteins, wherein the first contains one or more 
DNA binding domains and one or more SH3 ligand domains, the second containing 
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one or more transcriptional activation domains and one or more SH3 domains. These 
are depicted schematically in Figures 9-11. The DNA binding and transcriptional 
activation domains may be derived from the same or different transcription factors. 
Fusion proteins of this invention may also contain a targeting sequence 
5 providing for translocation of the protein to the nucleus. For example, such a targeting 
sequence may have a plurality of basic amino acids, referred to as a bipartite basic 
repeat (reviewed in Garcia-Bustos et al, Biochimica et Biophysica Acta (1991) 1071, 
83-101). This sequence can appear in any portion of the molecule internal or proximal 
to the N- or C-terminus and results in the fusion protein being inside the nucleus. 
1 0 The transcription components can be endogenous or exogenous to the cellular 

host. If the transcription factors are exogenous, but functional within the host and can 
cooperate with the endogenous RNA polymerase (rather than requiring an exogenous 
RNA polymerase, for which a gene could be introduced), then an exogenous promoter 
element functional with the fused transcription factors can be provided with a second 

1 5 construct for regulating transcription of the target gene. By this means the initiation of 

transcription can be restricted to the gene(s) associated with the exogenous promoter 
region, i.e., the target gene(s). 

A large number of transcription factors are known which require two subunits 
for activity. Alternatively, in cases where a single transcription factor can be divided 

2 0 into two separate functional domains (e.g. a transcriptional activator domain and a 

DNA-binding domain), so that each domain is inactive by itself, but when brought 
together in close proximity, transcriptional activity is restored. Transcription factors 
which can be used include yeast GAL4, which can be divided into two domains as 
described by Ma and Ptashne (Cell (1987) 48, 847-853) and Fields and Song, supra. 

2 5 One may use, for instance, a fusion of GAL4(1-147)-SNF1 and SNF4-GAL4(768- 

881), where the SNF1 and -4 may be replaced by the subject binding proteins as 
binding domains. Combinations of GAL4 and VP16 or HNF-1 and VP16 can be 
employed. Each of these proteins will have a linked partner, part of an interacting 
pair. The interaction (binding partner recognition) will bring the transcriptional 

3 0 activation domain (VP16) to DNA (GAL4 or HNF-1 ) and transcription will be 

induced (Figure 11). Other transcription factors that could be used for these studies 
include , but are not restricted to, the members of the Jun, Fos, and ATF/CREB 
families, Octl, Spl, HNF-3, the steriod receptor superfamily, and the like. 

3 5 DNA binding domains 

Preferred DNA-binding domains for use in this invention meet the following 
criteria. First, they should be small, stably folded domains. Second, there should be 
background information available on the nature of the interaction of the domain with 



WO 95/24419 3 0 PCT7US95/03208 

DNA and with other proteins. Third, the preferred domains should bind DNA as 
monomers, although domains that bind as dimers could be accomodated. Fourth, 
they should bind DNA with relatively rapid kinetics, particularly with regard to the 
association rate. Examples of DNA-binding domains that meet some or all of these 
5 criteria are domains of the homeodomain class, the zinc-finger class, and the paired- 
box class, for which numerous examples are known in the literature. 

In cases where detailed information on the molecular contacts of the protein 
with DNA is available, the DNA recognition specificity of the protein may be 
engineered by amino acid substitutions. 
1 0 A preferred feature for DNA binding domains is the lack of interactions with 

other cellular proteins. Alternatively, the nature of such interactions should be known 
with precision, so that these interactions could be abrogated by suitable amino acid 
substitutions within the DNA-binding domain. In addition, the DNA binding domain 
may be unique for the cell type utilized. The DNA binding element, when introduced 

1 5 into cells, will preferably not be occupied in the absence of exogenously added DNA 

binding domain. The extent of occupancy of the DNA binding element may be 
determined by measuring the level of gene transcription when the DNA binding 
domain is part of a reporter gene's regulatory elements, in the absence and presence of 
exogenously added cognate transcription factor. 

2 0 Cell-type specific factors (not found in the cells used for the in vivo assay), 

DNA binding domains with altered binding specificity and procaryotic DNA binding 
domains are well suited for the methods described. One suitable DNA-binding 
domain is derived from the Phoxl protein (Grueneberg, D. A., Natesan, S., 
Alexandre, C and Gilman, M. Z. (1992). Human and Drosophila homeodomain 

2 5 protein the enhance the DNA-binding activity of serum response factor. Science 257, 

1089-1095; Genbank accession number: M95929). This protein is a member of the 
homeodomain class. A 69 amino-acid domain derived from Phoxl is sufficient to 
bind DNA. High-affinity DNA recognition sequences for the protein have also been 
identified, as have amino acid substitutions that change its DNA recognition 

3 0 specificity and that affect its ability to interact with certain endogenous proteins in 

human cells. (Grueneberg et al) 

A second suitable DNA-binding domain is derived from the SRE-ZBP protein 
(Attar, R. M. and Gilman, M. Z. (1992). Expression cloning of a novel zinc-finger 
protein that binds to the c-fos serum response element. Mol.Cell. Biol. 12, 2432-2443; 
3 5 Genbank accession number: M88579). SRE-ZBP is a member of the C2H2 class of 
zinc-finger proteins. It has seven tandem zinc fingers. Any one of these zinc-finger 
domains or any combination of two or more domains can be used to generate a DNA- 
binding domain with novel recognition specificity. Furthermore, because the general 
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structure and mode of DNA recognition is known for proteins of this class, DNA 
recognition can be directly modified if necessary. 

Transcriptional activation domains 
5 In one embodiment, the SH3 domain will be linked to a DNA binding domain 

and therefore tethered to DNA associated with the reporter gene. The SH3 domain 
will have a previously identified peptide ligand binding partner. The SH3 ligand 
(SSL) will be expressed in cells as part of a hybrid protein which also contains a 
transcriptional activation domain. Binding of the SSL-bearing hybrid protein to the 

1 0 SH3-bearing transcriptional activator fusion protein is detected by monitoring 

reporter gene expression. The SH3 -SSL interaction results in the transcriptional 
activation domain being brought to promoter DNA, stimulating reporter gene 
transcription. Such SH3 interaction will result in the increased expression of a gene 
whose production and quantitation can be easily determined. 
15 In general, preferred transcriptional activation domains are compact in size, 

potent in activity, and non-toxic to cells. Additional features become relevant for 
particular applications. For example, for constructs to be used in one particular type 
of cell or tissue, activation domains with especially high activity in the target cell are 
used. In applications that involve the stable integration of the target gene into host 

2 0 cell DNA, activation domains that resist chromatin suppression are used. 

SH3 domains 

SH3 domains for use in the fusion proteins comprise all or part of an SH3- 
containing protein, including at least the SH3 domain of interest or ligand binding 

2 5 region thereof. Typically, the SH3 domain of our fusion proteins will be at least about 

40 ammo acids in length, and may contain up to all or substantially all of the amino 
acid sequence of the SH3-bearing protein or a portion sufficient to satisfy the 
functional criteria of an SH3 domain as described herein. The extents of the domain 
may be defined by homology to other SH3 domains, reference to previously 

3 0 characterized SH3 domains or function. In one embodiment, the inclusion of domains 

other than those generally regarded as necessary for SH3 function may be included if 
the additional domain(s) is hypothesized to allosterically or otherwise influence the 
behavior of the SH3 domain. In other embodiments, inclusion of protein domains that 
are not generally regarded as influencing, either allosterically or otherwise, the 
3 5 behaviour of the SH3 domain may be included to stabilize the SH3 fold, enhance 

expression or provide a means of identifying or manipulating the protein {e.g. fusion 
to glutathione-S-transferase or other domains that aid in identification/purification, 
epitope tags, etc). 
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SH3 ligand domains 

SH3 ligand domains for use in the fusion proteins will generally be as described 
above. In a cell free implementation, SH3 ligands may be synthetic or non-synthetic 
5 peptides, peptoids or small molecules identified from libraries, cell broths, natural 
extracts, etc. 

optional domains 

Compounds introduced into cells may be labile and therefore not present for 
1 0 long periods of time once introduced into the cell. It would be advantageous to have 
all components present in the cell at the time the cells are exposed to potential 
inhibitors of SH3 binding partners. We therefore have made constructs that consist of 
a steroid hormone ligand binding domain (HBD) fused to the SH3 ligand /activation 
domain protein. The regulatory properties of the steroid hormone ligand binding 

1 5 domain (Picard et al, Cell (1988) 54, 1073-1080; Eilers et al, Nature (1989) 340, 66- 

68) allows the post-translational interaction of the SH3 binding partner pair. The 
steroid hormone ligand binding domain serves as an autonomous regulatory domain: 
upon fusion to a heterologous protein, the hormone binding domain subjects the 
chimeric protein to hormonal control. This regulatory property is due to the hormone- 

2 0 reversable interactions of the HBD with heat shock proteins. Hence, the DNA binding 

domain/SH3 domain and the SH3 ligand/HBD/transcriptional activation domain 
can be expressed in cells containing the reporter gene. Only with the addition of 
steroid hormone will the SH3 binding partners interact. In the absence of hormone, the 
SH3 ligand/HBD/transcriptional activation domain fusion protein will not be 

2 5 competent to interact with the DNA bound SH3 binding partner. 

The HBD is an important feature of the system described. Inhibitors of SH3- 
dependent protein-protein interactions could exert their effect in two ways; compete 
with the ligand/HBD/activation domain for the binding partner or displace the 
ligand/HBD/activation domain), which has complexed with the binding partner. 

3 0 Mechanistically, it will be easier to block than displace binding partners. Potential 

inhibitors can be introduced into cells prior to complex formation (hormone addition) 
to block complex formation. 

In addition to the SH3 ligand and activation domain and HBD, other domains 
that might be part of this hybrid protein include an antibody recognition (epitope) 
3 5 sequence (for detection using antibodies to confirm that the fusion protein is made 

and that the level of expression is not affected by compounds tested for SH3 binding 
partner inhibition ) and a nuclear localization sequence. 
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4 Assembly of constructs 

DNA sequences encoding the DNA-binding or transcriptional activating 
domain(s) is joined to DNA encoding the SH3 domain(s) or SH3 ligand domain(s), as 
the case may be. These sequences are joined such that they constitute a single open 
reading frame that can be translated in cells into a single polypeptide harboring all 
intended domains. The order and arrangement of the domains within the polypeptide 
can vary. 

5 Introduction into cells & selection of transf ectants. 

These constructs may be introduced into cells by any convenient means. It may 
be most convenient to use simple transfection procedures. The constructs are 
combined onto a single plasmid vector together with a selectable marker gene, or they 
may reside on any number of individual plasmids which could be introduced by co- 
transfection. All three constructs may be combined into a single viral vector, or single 
or multiple plasmids may be administered by direct DNA injection or in conjunction 
with lipid carriers. 

Engineered cells and cell lines may be prepared by introducing into them the 
reporter gene of interest linked to the abovementioned expression elements and to a 
selectable marker, usually a constitutively expressed selectable marker. Introduction 
of DNA at this and other steps of the preparation of the cells may be carried out 
using any convenient procedures, including for example calcium phosphate 
transfection or lipofection. Read through transcription, e.g. from the promoter of the 
constitutively expressed selectable marker, may result in a detectable or even high 
basal level of expression for the reporter gene. [Therefore, cells will be selected that 
exhibit low or undetectable levels of reporter gene expression. An expression vector 
that produces a DNA binding domain /activation domain fusion protein will be 
introduced into the cell lines to select for cells with high levels of induced reporter 
gene expression. 

6 Setting up & running the assays 

The SH3-inhibitor indicator cell line will be cultured in standard tissue culture 
media containing the drugs necessary to select for cells which stabily retain the genes 
described above. Cells will be cultured in standard tissue culture dishes, which 
includes multidishes and microwell plates. Cells will be exposed to hormone and the 
transcription induced due to SH3 binding partner interactions assayed using 
standard colorometric or flourescence assays. 

Test compounds will be added to the culture media to assay the effect on SH3 
binding. Cells at this time may be bathed in tissue culture media (with or without 
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serum) or a balanced salt solution. The compounds to be tested may be cell 
permeable and therefore added directly to cells or it may be necessary to first 
permeable the cells using streptolysin O, tetanolysin or another cell permeabilizing 
agent. Cells will be exposed to hormone (to induce SH3 binding partner interactions). 
5 After transcription is induced, cells or media( if a reporter gene such as SEAP is 
used,see above) will be harvested and an extract prepared to assay reporter gene 
expression. 

7 how to interpret the results 

1 0 The SH3 fusion protein and SSL fusion protein are capable of forming a 

complex with each other. The cells express the reporter gene unless a substance is 
present which binds to the relevant SH3 domain or otherwise blocks the SH3- 
mediated interaction required for transcription of the reporter gene. In this assay, the 
cells are cultured or maintained in a suitable culture medium to establish a base-line 

1 5 for expression of the reporter gene. The test substance is added to the culture medium 

and the ability of the test substance to inhibit expression of the reporter gene is 
measured. If the level of reporter gene expression is reduced in the presence of the test 
substance, the test substance is a putative SH3 blocker with respect to the SH3 
domain involved in transcriptional control. There may be compounds that inhibit 

2 0 reporter gene expression by non-specific mechanisms, for example by having a general 

inhibitory effect on cellular transcription or translation. Because of this possibility, 
compounds will also be assayed for their effect on reporter gene expression when 
reporter gene transcription is not dependent on SH3 binding pair interactions. For this 
purpose, an indicator cell line will be utilized wherein the transcriptional activation 

2 5 domain is brought directly to DNA by the DNA binding domain (by expressing in 

cells a DNA binding domain/transcriptional activation domain fusion). The fusion 
protein will also contain a hormone binding domain (HBD) so that the only difference 
between the two indicator cell lines is the dependence upon SH3 binding pair 
interaction for reporter gene expression. 

30 

8 Pharmaceutical applications 

By virtue of its capacity to inhibit protein-protein interactions required for 
cellular events of pharmacologic importance, SH3 binders and blockers identified 
by the method ("SH3 inhibitor") of this invention may be used in pharmaceutical 

3 5 compositions and methods for treatment or prevention in a mammal in need 

thereof. 

Mammals include rodents such as mice, rats and guinea pigs as well as 
dogs, cats, horses, cattle, sheep, non-human primates and humans. 
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The preferred method of such treatment or prevention is by administering to a 
mammal an effective amount of the SH3 inhibitor to prevent, alleviate or cure said 
disease or disorder. Such effective amounts can be readily determined by evaluating 
the compounds of this invention in conventional assays well-known in the art 
5 including assays described herein. 

Therapeutic/Prophylactic Administration & Pharmaceutical Compositions 

The invention provides methods of treating, preventing and /or alleviating 

the symptoms and /or severity of a disease or disorder referred to above by 
1 0 administration to a subject of an SH3 inhibitor in an amount effective therefor. The 

subject will be an animal, including but not limited to animals such as cows, pigs, 

chickens, etc. f and is preferably a mammal, and most preferably human. 

Various delivery systems are known and can be used to administer the SH3 

inhibitor, e.g., encapsulation in liposomes, microparticles, microcapsules, etc. One 

1 5 mode of delivery of interest is via pulmonary administration, as detailed more fully 

infra. Other methods of introduction include but are not limited to intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural and 
oral routes. The SH3 inhibitor may be administered by any convenient route, for 
example by infusion or bolus injection, by absorption through epithelial or 

2 0 mucocutaneous linings {e.g., oral mucosa, rectal and intestinal mucosa, etc.) and 

may be administered together with other biologically active agents. Administration 
can be systemic or local. For treatment or prophylaxis of nasal, bronchial or 
pulmonary conditions, preferred routes of administration are oral, nasal or via a 
bronchial aerosol or nebulizer. 

2 5 In specific embodiments, it may thus be desirable to administer the SH3 

inhibitor locally to the area in need of treatment; this may be achieved by, for 
example, and not by way of limitation, local infusion during surgery, topical 
application, by injection, by means of a catheter, by means of a suppository, or by 
means of a skin patch or implant, said implant being of a porous, non-porous, or 

3 0 gelatinous material, including membranes, such as sialastic membranes, or fibers. 

This invention also provides pharmaceutical compositions. Such 
compositions comprise a therapeutically (or prophylactically) effective amount of 
the SH3 inhibitor, and a pharmaceutical^ acceptable carrier or excipient. Such a 
carrier includes but is not limited to saline, buffered saline, dextrose, water, 
3 5 glycerol, ethanol, and combinations thereof. The carrier and composition can be 
sterile. The formulation should suit the mode of administration. 

The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition can be a liquid 
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solution, suspension, emulsion, tablet, pill, capsule, sustained release formulation, 
or powder. The composition can be formulated as a suppository, with traditional 
binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
5 stearate, sodium saccharine, cellulose, magnesium carbonate, etc. 

In a specific embodiment, the composition is formulated in accordance with 
routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compositions for intravenous 
administration are solutions in sterile isotonic aqueous buffer. Where necessary, the 

1 0 composition may also include a solubilizing agent and a local anesthetic to ease 

pain at the side of the injection. Generally, the ingredients are supplied either 
separately or mixed together in unit dosage form, for example, as a lyophilized 
powder or water free concentrate in a hermetically sealed container such as an 
ampoule or sachette indicating the quantity of active agent. Where the composition 
15 is to be administered by infusion, it can be dispensed with an infusion bottle 

containing sterile pharmaceutical grade water or saline. Where the composition is 
administered by injection, an ampoule of sterile water for injection or saline can be 
provided so that the ingredients may be mixed prior to administration. 

Administration to an individual of an effective amount of the SH3 inhibitor 

2 0 can also be accomplished topically by administering the compound(s) directly to 

the affected area of the skin of the individual. For this purpose, the SH3 inhibitor is 
administered or applied in a composition including a pharmacologically acceptable 
topical carrier, such as a gel, an ointment, a lotion, or a cream, which includes, 
without limitation, such carriers as water, glycerol, alcohol, propylene glycol, fatty 

2 5 alcohols, triglycerides, fatty acid esters, or mineral oils. 

Other topical carriers include liquid petroleum, isopropyl palmitate, 
polyethylene glycol, ethanol (95%), polyoxyethylene monolaurate (5%) in water, or 
sodium lauryl sulfate (5%) in water. Other materials such as anti-oxidants, 
humectants, viscosity stabilizers, and similar agents may be added as necessary. 

3 0 .In addition, in certain instances, it is expected that the SH3 inhibitor may be 

disposed within devices placed upon, in, or under the skin. Such devices include 
patches, implants, and injections which release the compound into the skin, by 
either passive or active release mechanisms. 

Materials and methods for producing the various formulations are well 
3 5 known in the art [see e.g. US Patent Nos. 5,182,293 and 4,837,311 (tablets, 
capsules and other oral formulations as well as intravenous formulations)]. 

The effective dose of the SH3 inhibitor will typically be in the range of about 
0.01 to about 50 mg/kgs, preferably about 0.1 to about 10 mg/kg of mammalian 
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body weight, administered in single or multiple doses. Generally, the SH3 inhibitor 
may be administered to patients in need of such treatment in a daily dose range of 
about 1 to about 2000 mg per patient. 

The amount of the SH3 inhibitor which will be effective in the treatment or 
5 prevention of a particular disorder or condition will depend on the nature of the 
disorder or condition, and can be determined by standard clinical techniques. In 
addition, in vitro or in vivo assays may optionally be employed to help identify 
optimal dosage ranges. Effective doses may be extrapolated from dose-response 
curves derived from in vitro or animal model test systems. The precise dosage level 
1 0 of the SH3 inhibitor, as the active component(s), should be determined by the 

attending physician or other health care provider and will depend upon well known 
factors, including route of administration, and the age, body weight, sex and 
general health of the individual; the nature, severity and clinical stage of the 
disease; and the use (or not) of concomitant therapies. 

1 5 The invention also provides a pharmaceutical pack or kit comprising one or 

more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Optionally associated with such container(s) can be 
a notice in the form prescribed by a governmental agency regulating the 
manufacture, use or sale of pharmaceutical or biological products, which notice 

2 0 reflects approval by the agency of manufacture, use or sale for human 

administration. 

Pulmonary Administration 

In one embodiment of this invention, the SH3 inhibitor is administered by 

2 5 pulmonary administration, e.g. via aerosolization. This route of administration may 

be particularly useful for treatment or prophylaxis of bronchial or pulmonary 
infection or tumors. 

Pulmonary administration can be accomplished, for example, using any of 
various delivery devices known in the art (see e.g., Newman, S.P., 1984, in Aerosols 

3 0 and the Lung , Clarke and Davia (eds.), Butterworths, London, England, pp. 197- 

224; PCT Publication No. WO 92/16192 dated October 1, 1992; PCT Publication 
No. WO 91/08760 dated June 27, 1991; NTIS Patent Application 7-504-047 filed 
April 3, 1990 by Roosdorp and Crystal), including but not limited to nebulizers, 
metered dose inhalers, and powder inhalers. Various delivery devices are 
3 5 commercially available and can be employed, e.g., Ultravent nebulizer 

(Mallinckrodt, Inc., St. Louis, Missouri); Acorn II nebulizer (Marquest Medical 
Products, Englewood, Colorado), Ventolin metered dose inhaler (Glaxo Inc., 
Research Triangle Park, North Carolina); Spinhaler powder inhaler (Fisons Corp., 
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Bedford, Massachusetts) or Turbohaler (Astra). Such devices typically entail the 
use of formulations suitable for dispensing from such a device, in which a 
propellant material may be present. 

Ultrasonic nebulizers tend to be more efficient than jet nebulizers in 
5 producing an aerosol of respirable size from a liquid (Smith and Spino, 

"Pharmacokinetics of Drugs in Cystic Fibrosis," Consensus Conference, Clinical 
Outcomes for Evaluation of New CF Therapies, Rockville, Maryland, December 10- 
11, 1992, Cystic Fibrosis Foundation). 

A nebulizer may be used to produce aerosol particles, or any of various 
1 0 physiologically acceptable inert gases may be used as an aerosolizing agent. Other 
components such as physiologically acceptable surfactants (e.g., glycerides), 
excipients {e.g., lactose), carriers, and diluents may also be included. 

This invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those 

1 5 described herein will become apparent to those skilled in the art from the foregoing 

description. Such modifications are intended to fall within the the scope of the 
appended claims. 

Various patents, patent applications and publications are cited herein, the 
disclosures of which are incorporated by reference in their entireties. 

20 

Examples 

EXAMPLE 1 Phage Display Library Construction 

2 5 Reagents and Strains. T4 DNA ligase and T4 DNA polymerase were from 

Biorad. DNA sequencing was performed using a USB Sequenase kit as described by 
the manufacturer. Glutathione Sepharose 4B was purchased from Pharmacia. 
Oligonucleotides were synthesized with an Applied Biosystems xxx and purified 
using OPC columns. Peptides were synthesized using an Applied Biosystems synthe- 

3 0 sizer and purified by HPLC. Panning was performed using Costar 24 well (#3424) 

polystyrene dishes. 

Phage display library construction. M13 bacteriophage, containing an amber 
triplet (TAG) at position 20 of gene III DNA, was propagated in SupE E. coli host 
JM101. Single stranded DNA was prepared as described (Zoller and Smith, Meth. 
3 5 EnzymoL, 154:329 (1982)). Oligonucleotides were designed such that inserted 

sequences would be placed after position 20 of gene EQ. The oligonucleotides were 
also designed such that recombinants would lack the gene III amber codon. The E. coli 
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strain JS5, which does not suppress amber mutations, was used as the host to select 
for recombinants. 

Phosphorylation of oligonucleotide, annealing, synthesis of complementary 
DNA and electroporation was as described (Sambrook et al., Mol. CoL (Laboratory 
5 Manual) CSHL (1989); Zoller and Smith Supra). Four phage display libraries were 
prepared. The oligonucleotides used for the synthesis of the libraries were: 

First library (X 6 ) (SEQ ID NO: 160) 

(S' Haaacaacmcaacagt cccKNN CNN CNN CNN CNN CNN 
1 0 ccca gcggagtgagaata (3') (SEQ ID NO.:161) 

Second library (X 6 PPIP) (SEQ ID NO: 3) 

(S^gttttgctaaacaactttcaacagtcccgggtattggtgg KNN KNN KNN KNN KNN KNN 
ttcagcggagtgagaatagaaagg (3') (SEQ ID NO.:162) 

1 5 

Third library (RSLRPLX 6 ) (SEQ ID NO: 65) 
(5') gttttgctaaacaactttraara gt YNN YNN YNN YNN YNN YNN 
caaggggcggagteagcgttcagcggagtgagaatagaaaggffi') (SEQ ID NO.:163) 

2 0 Fourth library (PPPYPPX 6 ) (SEQ ID NO: 100) 

(5')gttttgctaa acaactttcaacagt YNN YNN YNN YNN YNN YNN 
c gg t ggatatggtggtggttcagcggagtgagaatagaaagg(3') (SEQ ID NO.:164) 

Underlined nucleotides represent complementary M13 sequences. Sequences 

2 5 inserted into gene HI are not underlined. Nucleotides used to create random amino 

acid sequences are denoted N, K and Y (N=A,G,C+T; K=G+T; Y=C+A). The other 
gene Hi-inserted sequences are described in the text. 

Amplification 

3 0 Bacteriophage were amplified in 500 mis of 2XYT media containing 10 mis of 

stationary phase JS5 cells at 37°C, 250 rpm for 6 hrs. The supernatant was cleared 
twice by centrifugation to remove bacteria (4000 rpm for 15 min in a Sorvall H4000 
rotor at 4°C) and phage precipitated by adjusting the supernatant to 0.5M NaCl/4% 
PEG. 

35 

EXAMPLE 2 Isolation of Phage with Affinity for Selected SH3 Domains 

A. Affinity Selection. Phage containing peptides with affinity for SH3 
domains were isolated by a method known as panning (Parmley and Smith, Gene, 
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73:305-318 (1988)). Wells (24 dishes) were coated first using 10 \ig of GST or GST- 
SH3 protein in 100 mM NaC03, pH 8.5 o/n at 4°C and then for 30 min at room tem- 
perature using a 20 mg/ml BSA (in 100 mM NaCC>3, pH 8.5) blocking buffer. Phage 
(1.5 X 10 11 infectious particles) were panned in 250 |il of 50 mM Tris HC1, pH 
5 7.5/150 mM NaCl containing 0.5% BSA and 0.05% Tween 20 o/n at 4°C. Wells were 
washed 3 X using TBS/0.5% BSA/0.05% Tween 20 at room temperature. Bound 
phage were eluted using 200 (il of 0.2M Glycine, pH 2.1/0.5% BSA/0.05% Tween 20 
and neutralized using 50 |il 1M Tris base. Eluted phage used again for panning were 
first amplified in 20 mis 2XYT media containing 0.4 mis of stationary phase JS5 cells 
1 0 and purified /concentrated as described above. 

B. Isolation and Sequencing of Phage Selected by the Src SH3 Domain. 

Degenerate oligonucleotides were designed so that a phage library could be 
prepared containing six residues of random amino acid sequences inserted into the 
bacteriophage M13 gene HI polypeptide. These random amino acid sequences were 

1 5 inserted into gene IE, bracketed by glycine residues, so that the polypeptide might be 

flexible and presented more efficiently as part of a gene III fusion. A library of 
2.5x10^ recombinants were generated. 

GST and GST-Src SH3 were each immobilized on polystyrene and binding 
selection was performed. After six cycles of selection/amplification, GST Src SH3- 

2 0 dependent phage enrichment was observed; the phage titer was 100-fold higher with 

GST-Src SH3, compared to a binding assay using GST. After eight cycles 
(approximately 1000 fold enrichment), ten bacteriophage were isolated and se- 
quenced. Each phage had the same nucleotide sequence, encoding a proline-rich 
peptide insert. In addition, a point mutation in flanking sequence (GGG->AGG) had 

2 5 occurred, converting a glycine residue to arginine: 

-GXXXXXXG- (SEQ ID NO.: 157) starting library 

--RSLPPIPG- (SEQ ID NO.: 158) GST-Src SH3 selected 

The phage displayed Src SH3 selected peptide is interesting as it has some 
similarity to part of a protein, termed 3BP1, identified by Cicchetti et al., who 

3 0 screened a Igtll cDNA expression library using the SH3 domain of Abl. This group 

also found that Src's SH3 domain could bind to 3BP1. The region of 3BP1 sequence 
important for SH3 binding is proline rich, as is the peptide identified by phage 
display. 

Curiously, the isolation of phage containing the proline-rich sequence required 
3 5 six cycles of binding selection. Phage enrichment is usually observed after 2-3 cycles. 
That six cycles were required, and only phage with the same nucleotide sequence 
identified, suggests that the phage selected by GST-Src SH3 were not present in the 
original library used for screening but instead generated during binding selection. The 



WO 95/24419 £ -j PCT7US95/03208 

single point mutation, which changed the glycine residue to arginine, probably 
occurred during the amplification of phage and was important for Src SH3 
recognition. These results support the view that the Src SH3 recognition sequence is 
proline-rich, at least seven amino acid residues in length and perhaps preferring an 
5 arginine residue as part of the recognition sequence. 

To expand our understanding of Src SH3 domain binding preferences, a new 
phage display library was prepared (2X10 9 recombinants) that contained six random 
amino acid residues inserted into gene III, adjacent to the proline-rich sequences 
identified in the first screen: -XXXXXXPPIP-- (SEQ ID NO: 3), referred to as the 
1 0 XgPPIP sequence. 

The X 6 PPIP library (SEQ ID NO: 3) was screened, using the Src SH3 domain 
(as a GST fusion) and with GST. GST-Src SH3 retained lOx more phage than GST 
after one cycle of selection suggesting the enrichment of certain phage. After three 
cycles of binding selection (approximately a 1000 fold enrichment) phage were 

1 5 isolated and sequenced. The amino acid sequences (SEQ ID NOS: 4 to 64) identified 

by screening the library using the Src SH3 domain are shown in Figure 1. 

With respect to the six randomized residues upstream of PPIP, at positions 1, 
2 and 3, no particular amino acid preferences were observed. In contrast, at position 
4; arginine was preferred, at position 5; proline and at position 6; leucine. That 

2 0 arginine is preferred at the fourth position is consistent with the notion that the point 

mutation observed with the isolate from the first library (SEQ ID NO: 2) (a glycine to 
arginine change) was important for Src SH3 selection. We also note that the inclusion 
of the amino acids PPIP as part of the gene m displayed residues probably is 
important for Src SH3 recognition as phage enrichment was observed after only one 

2 5 round of panning. 

To further explore SH3 binding preferences, the Xg PPIP phage display library 
(SEQ ID NO: 3) was also screened using the Fyn, Lyn, PI3K, Abl domains (as GST 
fusions) or using Grb-2 protein (which contains two SH3 domains). Four distinct 
ligand classes were identified (see Figure 1): 

3 0 Src, Fyn SH3 domains - XXXRPL(PPIP) (SEQ ID NO: 165) 

Lyn, PI3K SH3 domains - RXXRPL(PPIP) (SEQ ID NO: 166) 
Abl SH3 domain - PPPYPP(PPIP) (SEQ ID NO: 167) 

Grb-2 SH3 domains - RLYRPL(PPIP) (SEQ ID NO: 168) 

To more precisely define the ligand preferences of the SH3 domains listed 
3 5 above, three new libraries were prepared; RSLRPLXXXXXX, to further define Src, Fyn, 
Lyn and PI3K SH3 binding preferences (SEQ ID NO.: 65); PPPYPPXXXXXX, to 
further define Abl SH3 binding preferences (SEQ ID NO.: 100) and RLYRPLXXXXXX, 
to further define Grb-2 SH3 binding preferences (SEQ ID NO.: 109). Bacteriophage 
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were selected from these new libraries using GST-SH3 fusions or Grb-2, and after four 
rounds of panning, phage were isolated and sequenced. Again, specific binding 
preferences were observed (Figure 2). 

Src, Lyn SH3 domains -(RSLRPL)PPLPXP (SEQ ID NO: 169) 
5 PI3K SH3 domain - (RSLRPL)PPLPPP (SEQ ID NO: 170) 

Grb-2 SH3 domains - (RLYRPL)PPLPXX (SEQ ID NO: 171) 
Abl SH3 domain - (PPPYPP)PPI/VPXX (SEQ ID NOS: 172 and 173) 
Fyn SH3 domain - (RSLRPL)PPI/VPXX (SEQ ID NOS: 174 and 175) 
Randomizing six amino acid residues at a time, we have identified specific 
1 0 peptide ligands recognized by the Src, Fyn, Lyn, PI3K, Abl and Grb-2 SH3 domains. 
With the above studies, we have randomized a "window of SH3 binding recognition" 
that totals twelve residues in length. To determine whether additional (flanking) 
sequences contribute to binding, two new libraries were prepared that were designed 
to be useful for studying Src, Fyn, Lyn and PI3K SH3 binding preferences. These new 

1 5 phage displayed peptides include "core sequences", essentially a consensus sequence 

derived from the results of the screens described above. In addition, five randomized 
residues were placed either upstream (--XXXXXRPLPPLPPP--, SEQ ID NO.: 133) or 
downstream (-RSLRPLPPLPXXXXX— , SEQ ID NO.: 114) of the core sequences. These 
two libraries were screened to identify additional SH3 binding peptides using the Src, 

2 0 Lyn and PI3K SH3 domains. Specific binding preferences were observed. The selected 

sequences are shown in Figure 4A and Figure 4B. 

To determine whether these new sequences are important determinants for 
specific binding, two phage-displayed peptides, isolated from the X5RPLPPLPPP 
(SEQ ID NO: 133) library using the Lyn and Src SH3 domains (Lyn selected: - 

2 5 PWWLDRPLPPLPPP-, (SEQ ID NO.: 141) and VPLARRPLPPLPPP--, (SEQ ID 

NO.: 149)) were subjected to BIAcore® analysis (see above). GST-Lyn and GST-Src 
SH3 domains were immobilized on a Biosensor chip and the interaction between SH3 
domain and phage-displayed peptides were measured (Figure 5). Binding selectivity 
was clearly observed. Phage selected using the Lyn SH3 domain bound more effi- 

3 0 ciently to the Lyn SH3 domain and the phage selected using the Src SH3 domain 

bound more efficiently to the Src SH3 domain. 

C. Isolation and Sequencing of Phage Selected by the Abl SH3 Domain. 

They X 6 PPIP (SEQ ID NO: 3) phage display library was also used to more 
3 5 precisely define Abl SH3 binding preferences. As shown in Figure 3, the sequences 
selected by the Abl SH3 domain are distinct from those identified by the Src SH3 
domain. Because the Abl selected sequences are distinct, a new library, PPPYPPX5 

(SEQ ID NO: 100) was prepared and screened to identify more precisely the ligands 



WO 95/24419 



PCT/US95/03208 



preferred by the Abl SH3 domain. The results of the PPPYPPX 6 library screen are 
shown in Figure 2 (SEQ ID NO: 101 to 108). 

D . Summary of Results 
5 We have constructed phage display libraries to generate a rich source of 

structurally diverse peptides to identify high affinity ligands for the Src, Fyn, Lyn, 
PI3K, Grb-2 and Abl SH3 domains. We observe that although peptides selected by 
these SH3 domains are proline-rich, they are distinct. In this study, inroads towards 
understanding SH3 binding specificity was first obtained with the fortuitous isolation 
10 of a Src SH3 selected phage displayed peptide and the finding that the Src SH3 

recognition sequence was proline-rich and at minimum seven residues in length. This 
information was important for the design of a phage display library XXXXXXPPIP, 
(SEQ ID NO.: 3) that when screened, yielded insight as to Src, Fyn, Lyn, PI3K, Grb-2 
and Abl SH3 binding preferences (Figure 1). We continued this strategy by preparing 

1 5 and screening additional phage display libraries that contained either five or six 

random residues adjacent to sequences that helped to constitute SH3 recognition se- 
quences. This approach has led to the identification of distinct SH3 binding 
preferences. Based on information obtained from screening seven phage display 
libraries, we have denned what we term the SH3 ligand core (Figure 1 and 2) and 

2 0 SH3 ligand flanking sequences (Figures 4A and 4B). 

The Src, Fyn, Lyn, PI3K, Abl and Grb-2 SH3 ligand core sequences were 
defined using the libraries XXXXXXPPIP (SEQ ID NO.: 3), RSLRPLXXXXXX (SEQ ID 
NO.: 65), PPPYPPXXXXXX (SEQ ID NO.: 100) and RLYRPLXXXXXX (SEQ ID NO.: 
109). That specific flanking sequences could be identified was demonstrated using the 

2 5 Src, Lyn and PI3K SH3 domains and the libraries XXXXXRPLPPLPPP (SEQ ID NO.: 

133) and RSLRPLPPLPXXXXX (SEQ ID NO.: 114). The claim of SH3 specific ligands 
is based on the unique sequences identified for a particular SH3 binding domain 
(Figures 1, 2, 4A and 4B) as well as the specificity observed with respect to binding 
affinity (Figures 3 and 5). 

3 0 The core peptides identified in this report define the minimal sequence 

requirements for efficient recognition by the Src, Fyn, Lyn, PI3K, Grb-2 and Abl SH3 
domains. When using Src, Lyn or PI3K SH3 domains to screen phage display libraries 
that containing random residues either upstream or downstream of core residues, we 
observe additional amino acid preferences (see figures). 
3 5 The single letter amino acid codes used herein are standard and represent 

amino acids including the following: 
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R - Arginine; 
Y = Tyrosine; 
A = Alanine; 



S = Serine; 

G = Glycine; 

W = Tryptophan; 



L = Leucine; 

I = Isoleucine; 

X = any amino acid. 



P = Proline; 
V = Valine; 



5 Example 3 Assessment of Binding of Phage-Selected Peptides to Src and Abl SH3 



The Src and Abl SH3 domains (Src amino acids 81-140 and Abl amino acids 
84-138) were expressed as carboxy-terminal fusions to glutathione-S-transferase 
(amino acids 1-226 (Smith, D.B. and Johnson, K.S., Gene 67:31-40 (1988)) in 
1 0 Escherichia coli and purified to >95% homogeneity by glutathione affinity chroma- 
tography. GST-Src and GST-AW fusion proteins were covalently immobilized via 
solvent-exposed lysines to a BIAcore Biosensor Chip CM-5 (Pharmacia Biosensor, 
Piscataway, NJ) using standard N-ethyl-N'-(3-dimethylaminopropyl)-carbodiimide 
hydrochloride/N-hydroxysuccinimide chemistry (O'Shannessy, D.J. et al., Analyt. 

1 5 Biochem. 205:132-136 (1992)). The resulting GST-Src and GST-Abl biosensor chips 

were docked into a BIAcore Biosensor (Pharmacia Biosensor, Piscataway, NJ) and 
the biospecific interaction between immobilized SH3 domain and phage-displayed 
peptides measured by surface plasmon resonance (Malmqvist, M, Nature 361:186- 
187 (1993); Jonsson, U. and Malmqvist, M., Advances in Biosensors, JAI Press Ltd., 

2 0 London, 1992, pp. 291-336; Jonsson, U. et al., BioTechniques ll(5):620-627 (1991)). 

Interaction between receptor (SH3) and ligand (phage-displayed peptide) is indicat- 
ed by an increase in RU value (positive inflection in the plane of the ordinate). 
Assays were run at 25°C in phosphate buffered saline, pH 7.4, containing 0.05% Sur- 
factant P20 ((Pharmacia Biosensor, Piscataway, NJ) at a flow rate of 5 \iL min" 1 . 

2 5 10 13 pfu/ml of Src-selected phage displaying the peptide sequence RSSRPLPPIP 

(SEQ ID NO.: 35 or AbZ-selected phage displaying the peptide sequence PPPYPPPPIP 
(SEQ ID NO.: 49) were used to measure binding. 

Results of these assays are represented graphically in Figures 3A-3D. As is 
evident, Src-selected phage bound specifically to immobilized GST-Src SH3 (Figure 

3 0 3 A) and did not bind to immobilized GST-Abl SH3 (Figure 3B). As also shown, Abl- 

selected phage bound specifically to immobilized GST-AW SH3 (Figure 3D) and did 
not bind to immobilized GST-Src SH3 (Figure 3C). Thus, these results demonstrate the 
binding specificity and selectivity of a ligand to selected SH3 domain. Specifically, 
the results show that a peptide ligand selected on the basis of its ability to bind Src 
3 5 binds Src SH3 but not Abl SH3 and, conversely, that a peptide ligand selected on the 
basis of its ability to bind Abl binds Abl SH3 but not Src SH3. 



Domains 
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EXAMPLE 4 Association of Src Protein With Tyrosine Phosphorylated Proteins in 

vivo 

Several proteins have been reported to coprecipitate with v-Src from extracts 
of v-Src-transformed cells (Kanner, S.B. et al., EMBO J. 10:1689-1698 (1991); Koch, 
5 C.A., Mol. Cell. Biol. 12:1366-1374 (1992). To further analyze the involvement of the 
SH3 domain in the association of these proteins with Src, we examined the ability of 
the Src SH3-binding peptides to inhibit the coprecipitation of Src with tyrosine 
phosphorylated proteins. We also examined the ability of a high affinity Src SH2- 
binding peptide (EPQpYEEIPI (SEQ ID NO: 176), denoted here as YEEI) to block 
1 0 SH2 domain-dependent interactions. In this assay, Balb/c 3T3 and SRD-3T3 cells 
(V-Src transformed 3T3 cells) were lysed in RIPA buffer and the Src protein was 
immunoprecipitated with a monoclonal antibody to the amino-terminus of Src in the 
presence or absence of the SH2 and SH3 binding peptides. For comparison, the SH3 
binding proteins were also isolated by affinity isolation using GST-Src-SH3. The 

1 5 samples were transferred to nitrocellulose, and the blot was probed with a 

monoclonal antibody to phosphotyrosine. Two major tyrosine phosphorylated 
proteins co-migrated with v-Src from SRD 3T3 cells, one of which co-migrated with 
the 62 kDa tyrosine-phosphorylated protein that bound to the Src SH3 domain, and 
the other protein of Mr 130,000, which is most likely the 130,000 kDa protein 

2 0 previously detected in v-Src immunoprecipitates. (Koch, C.A., Mol. Cell. Biol. 

12:1366-1374 (1992)). 

The binding of Src to the 62 kDa protein was reduced significantly by 
incubation of the extracts with the "Src-pro" peptide or the phosphorylated YEEI 
peptide, but not the poly-proline nor the unphosphorylated YEEI peptides, suggesting 

2 5 that both the SH2 and SH3 domains of Src are required for stable association of this 

protein with Src. We have been unable to determine whether this protein is the p62 
protein identified in the total cell SH3 binding proteins since the p62 antibody is not 
sensitive enough to recognize the small amounts of the 62 kDa protein that associate 
with Src under these conditions. The "Src-pro" peptide and the pYEEI peptide could 

3 0 only block 50% of pl30 binding to Src, whereas 90% inhibition of pl30 binding was 

observed when both peptides were used, suggesting that either the SH2 or the SH3 
domain is sufficient for some degree of pl30 association with Src. 

EXAMPLE 5 

3 5 SSLs may be identified based on their ability to inhibit formation of an 

SH3/ligand complex. A known ligand, preferably a peptide, is immobilized to the 
surface of the sensor material or chip as defined for the instrument(e.g. seeBIAcore® 
System Manual BIAcore Biosensor AB, Uppsalla, Sweden, 1994) using known 
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chemistries (BIAapplications Handbook, BIAcore Biosensor AB, Uppsalla, Sweden, 
1994 ). The ability of a potential SSL to bind to a known SH3 domain is inferred 
from its ability to inhibit the formation of SH3 domain/ligand complex as measured 
in the presence and absence of various concentrations of SSLs using surface plasmon 
5 resonance technologies. 

For example, 360 resonance units (RU) of a Grb-2 SH3-ligand (Ac- 
EVPWPPVPPRRRPGGK-NH 2 ) were covalently immobilized to a Biosensor Chip 
CM5 (Pharmacia Biosensor) through the e-amino group of the lysyl residue. 
Immobilizations were performed at 25°C with a flow rate of 5 ul min -1 in HBS 
1 0 running buffer (10 mM HEPES pH 7.4, 150 mM NaCl, 3.4 mM EDTA, 0.05% 

Surfactant P20 (Pharmacia Biosensor)) as follows: 30 uL injection of 0.2 M l-ethyl-3- 
(3-dimethylammopropyl)-carbodiimide hydrochloride / 0.05 M N-hydroxy- 
succinimide to create reactive linkers; 30 uL injection 0.1 mM peptide in HBS to 
permit covalent linking; 30 uL injection of 1.0 M ethanolamine hydrochloride for 5 

1 5 minutes to block unreacted N-hydroxy-succinimide esters. To remove non-covalently 

bound peptide and stabilize the hydrogel, the chip was cycled through ten 
consecutive regeneration loops consisting of 2 minutes exposure to 6 M guanidine HC1 
(pH 7.0); 5 minutes exposure to HBS (flow rate of 5 uL mkr 1 ). Identical wash cycles 
were used to regenerate the chip between sensorgram. 

2 0 Assays were initiated by combining luM Grb-2(P204L) SH3 domain with 

various concentrations (100, 50, 25, 10, 5, 2.5, 1 or 0 uM) of SOS-derived peptide 
(Ac-EVPVPPPVPPRRRPGGK-NH2) or test peptide (Ac-SHRLYRPLPPLPGGK- 
NH2) and pre-incubated at 25°C for >30 minutes. Assays were performed in HBS at 
25°C with a flow rate of 3 ul min" 1 in PBS/imidazole (1.2 mM KH2P04, 8.1 mM 

2 5 Na2HPO4.7H20, 138 mM NaCl, 0.5 mM MgC12.6H20, 2.7 mM KC1, 0.9 mM CaC12, 

80 mM immidazole, pH 7.4). Complex formed between the SH3 domain and the 
immobilized ligand was measured using the BIAcore Biosensor® (Pharmacia 
Biosensor, Piscataway, NJ) for each concentration of competitor and the IC50 
calculated. See Figure ZX. 

3 0 Alternatively, SSLs may be identified based on their ability to bind to SH3 

domains, again, as measured using BIAcore® and allied technologies. For example, an 
SSL (or SH3 domain) is immobilized to the surface of the sensor material or chip as 
defined for the instruments^. seeBIAcore® System Manual, BIAcore Biosensor AB, 
Uppsalla, Sweden, 1994) using known chemistries (BIAapplications Handbook, 
3 5 BIAcore Biosensor AB, Uppsalla, Sweden, 1994 ) as above. The ability of the 

immobilized SSL (or SH3 domain) to bind to an SH3 domain (or SSL) is directly 
measured using surface plasmon resonance technologies. See Rickles et al, EMBO 
supra. 
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Claims 



1. A method for identifying a peptide comprising an amino acid sequence not 
present in a naturally occurring protein, which peptide is a ligand for a protein of 
interest containing at least one SH3 domain, the method comprising: 

(a) contacting the SH3 domain with a mixture of one or more candidate 
peptides under conditions permitting a ligand to bind to an SH3 domain to 
form an SH3 domain-ligand complex, 

(b) separating unbound peptides from any SH3 domain-ligand complexes so 
formed, 

(c) dissociating the complexed peptide Iigand(s) from the SH3 domain-ligand 
complexes, 

(d) enriching for the selected peptide(s) by re-contacting the dissociated 
peptides with the SH3 domain under conditions permitting a ligand to bind to 
an SH3 domain to form a complex, and, 

(e) detecting which if any candidate ligands bound to the SH3 domain. 
A method of claim 1 which further comprises 

(a) dissociating any candidate peptide ligands from SH3-ligand complexes 
formed during the enrichment step, and 

(b) identifying the amino acid sequences of the dissociated peptide ligands. 

A method for identifying a peptide comprising an amino acid sequence not 
present in a naturally occurring protein, which peptide is a ligand for a protein of 
interest containing at least one SH3 domain, other than Abl or phosphatidylinositol 
3-kinase, which method comprises: 

(a) combining an SH3 domain of interest with one or more candidate peptide 
ligands under conditions permitting a ligand to bind to an SH3 domain to 
form an SH3-ligand complex, 



3 



(b) detecting which if any candidate ligands bound to the SH3 domain. 
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4. A method for identifying a peptide ligand for a protein of interest containing 
at least one SH3 domain, other than Abl, Src, Grb2, Nek or phosphatidylinositol 3- 
kinase which method comprises: 

(a) combining an SH3 domain of interest with one or more candidate peptide 
ligands under conditions permitting a ligand to bind to an SH3 domain to 
form an SH3-ligand complex, 

(b) detecting which if any candidate ligands bound to the SH3 domain. 

5 . A method of any of claims 1 through 4 wherein the SH3 domain is provided in 
the form of a fusion protein. 

6. A method of any of claims 1 through 5 wherein the mixture of peptides is 
provided in the form of a library of synthetic peptides or in the form of a phage 
library displaying the various peptides. 

7. A method of any of claims 1 through 6 which further comprises contacting the 
ligand so identified with an SH3 domain from a second protein tinder conditions 
permitting the formation of an SH3 domain-ligand complex and determining the 
presence or extent of binding of the ligand to the SH3 domain from the second 
protein. 

8. A peptide ligand for an SH3 domain of interest identified by the method of 
any of claims 1 through 7. 

9 . A peptide ligand for an SH3 domain of interest comprising an amino acid 
sequence not present in a naturally occurring protein, which was identified by the 
method of any of claims 1 through 7. 

10. A peptide ligand specific for an SH3 domain of interest, identified by the 
method of of any of claims 1 through 7, which binds to the SH3 domain against 
which it was selected with a Kd value of less than half of the Kd with which it binds 
to a different SH3 domain.. 

11. A peptide ligand of claim 8, 9 or 10 containing at least seven amino acid 
residues. 

12. A peptide ligand of claim 11 containing at least 10 amino acid residues. 



13. A peptide ligand of claim 12 containing at least 12 amino acid residues. 
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14. A peptide ligand of claim 13 containing at least 14 amino acid residues. 

15. A peptide ligand of claim 14 containing at least 15 amino acid residues. 

16. A peptide comprising an amino acid sequence not present in a naturally 
occurring protein, which peptide is a ligand capable of binding selectively to an SH3 
domain from src, fyn, lyn, yes, txk, tsk, btk, NADPH oxidase p47-phox or p67-phox, 
or crk. 

17. A peptide ligand capable of binding selectively to an SH3 domain from the 
group consisting of (1) members of the src-family protein tyrosine kinases (including 
Lyn, Lck, Hck, Fgr, and Yes), (2) Sprk, a threonine /serine protein kinase, (3) Tsk, (4) 
Btk, (5) Txk and other Tec family members, (6) Vav, (7) GTPase Activating Protein 
(GAP), (8) p40, p47, and p67 proteins of the neutrophil oxidase complex, (9) Crk, 
and (10) phospholipase C gamma. 

18. A DNA sequence comprising a nucleotide sequence encoding a peptide ligand 
of any of claims 8 through 17. 

19. A method for identifying a substance which inhibits the interaction of proteins 
mediated by an SH3 domain of interest, which method comprises: 

(a) providing (i) an SH3 domain of interest, (ii) a specific SH3 ligand (SSL) 
comprising a peptide of claim 8, 9, 10, 11, 12, 13, 14, 15, 16, or 17 which is 
capable of selectively binding to the SH3 domain of interest to form an SH3- 
SSL complex and (iii) a composition comprising a test substance to be 
evaluated; 

(b) incubating the SH3 domain and the SSL under conditions permitting the 
formation of an SH3-SSL complex, in the presence and absence of the test 
substance; and 

(c) measuring the ability of the test substance to inhibit the formation or 
persistence of the SH3-SSL complex. 

20. A method for identifying a substance which inhibits the interaction of proteins 
mediated by an SH3 domain of interest, which method comprises: 

(a) providing (i) an SH3 domain of interest, (ii) a specific SH3 ligand (SSL) 
which is capable of selectively binding to the SH3 domain of interest to form 
an SH3-SSL complex but comprises an amino acid sequence not present in a 
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naturally occurring protein and (iii) a composition comprising a test substance 
to be evaluated; 

(b) incubating the SH3 domain and the SSL under conditions permitting the 
formation of an SH3-SSL complex, in the presence and absence of the test 
substance; and 

(c) measuring the ability of the test substance to inhibit the formation or 
persistence of the SH3-SSL complex. 

21. A method for identifying a substance which inhibits the interaction of proteins 
mediated by an SH3 domain of interest, which method comprises: 

(a) providing (i) an SH3 domain of interest other than an SH3 domain from 
Grb-2 or Fyn, (ii) a specific SH3 ligand (SSL) which is capable of selectively 
binding to the SH3 domain of interest to form an SH3-SSL complex and (iii) a 
composition comprising a test substance to be evaluated; 

(b) incubating the SH3 domain and the SSL under conditions permitting the 
formation of an SH3-SSL complex, in the presence and absence of the test 
substance; and 

(c) measuring the ability of the test substance to inhibit the formation or 
persistence of the SH3-SSL complex. 

22. A method of claim 19, 20 or 21 in which the SH3 domain is provided in the 
form of a fusion protein containing the SH3 domain linked to heterologous peptide 
sequence. 

23. A method of claim 19, 20 or 21 in which the SSL is a peptide of at least 7 
amino acid residues. 

24. An inhibitor of protein-protein interactions mediated by an SH3 domain of 
interest identified by the method of any of claims 19 through 23. 

25. A genetically engineered cell containing and capable of expressing: 

(a) a reporter gene, 

(b) a first DNA sequence encoding a DNA-binding fusion protein containing a 
DNA-binding domain and a first SH3 binding partner, and 



WO 95/24419 



PCT/US95/03208 



(c) a second DNA sequence encoding a transcriptional activating fusion 
protein containing a transcriptional activating domain and a second SH3 
binding partner capable of binding to the first SH3 binding partner, 
wherein 

(i) the reporter gene is under the expression control of a DNA element to 
which the DNA-binding fusion protein is capable of binding, 

(ii) the transcriptional activating fusion protein is capable of binding to the 
DNA-binding fusion protein, and 

(iii) expression of the reporter gene is measurably activated under conditions 
permitting the binding of the DNA-binding fusion protein to the DNA element 
and to the transcriptional activating fusion protein. 

26 . A genetically engineered cell of claim 25, wherein: 

(a) the first SH3 binding partner is a peptide sequence comprising the amino 
acid sequence of an SH3 domain of interest, and the second SH3 binding 
partner is a peptide sequence comprising a specific ligand for the SH3 domain 
of interest; or 

(b) the first SH3 binding partner is a peptide sequence comprising a specific 
ligand for the SH3 domain of interest, and the second SH3 binding partner is 
a peptide sequence comprising the amino acid sequence of an SH3 domain of 
interest. 



27. A method for identifying an inhibitor of interactions between proteins 
mediated by one or more SH3 domains which comprises: 

(a) culturing cells of claim 25 or 26 in an appropriate culture medium 
permitting growth of the cells and under suitable conditions permitting 
detectable expression of the reporter gene, 

(b) adding a test material to the culture medium, and 

(c) identifying any decrease in the expression of the reporter gene. 

28. A method of claim 27 wherein the test material comprises a sample of: a 
microbial broth; cellular extract; conditioned medium from cell lines or from host cells 
transformed with genetic libraries; a collection of one or more synthetic compounds; 
or a combinatorial library of compounds. 



29. An inhibitor identified by the method of claim 27 or 28. 
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Figure 1A/14 
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CLASS II +5 
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